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"Express Mail" mailing label numbe r EV 314842865 US 
Date of Deposit 1 October 2003 . 

REQUEST FOR FILING A CONTINUING PATENT APPLICATION UNDER 37 CFR § 1.53(b)(1) 
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Case No. 


ANTICIPATED CLASSIFICATION OF 
THIS APPLICATION 


PRIOR APPLICATION EXAMINER 


ART 
UNIT 


10466/Ji 5 


CLASS 


SUBCLASS 


Elizabeth Kemmerer 


1646 



Address to: 

Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



This is a request for filing a continuation □ divisional application under 37 CFR § 1.53(b)(1), of pending prior 
application number 09/944,396 , filed on August 30. 2001 , entitled SECRETED AND TRANSMEMBRANE 
POLYPEPTIDES AND NUCLEIC ACIDS ENCODING THE SAME . 

1. Copy Of the Prior application, including 34 sheets of drawings, 152 pages of Application 

2. S Copy of the Declaration filed in the Prior application. 

3. □ PTO Form 1449 and Information Disclosure Statement. 



CLAIMS (1)FOR 



(2) NUMBER FILED | (3) NUMBER EXTRA | (4) RATE (5) CALCULATIONS 



TOTAL CLAIMS 
(37 CFR 1.16(c)) 



20-20 = 



x$18 = 



INDEPENDENT CLAIMS 
(37 CFR 1.16(b)) 




x$84 = 



MULTIPLE DEPENDENT CLAIMS (if applicable) (37 CFR 1.16(d)) 



+ $280 



$ 0 



BASIC FEE 
(37 CFR 1.16(a)) 



$770 



Total of above Calculations = 



Reduction by 50% for filing by small entity (Note 37 CFR 1 .9, 1 .27, 1 .2 8) 



$ 



TOTAL : 



$770.00 



4. □ A verified statement to establish small entity status under 37 CFR 1 .9 and 1 .27 

□ is enclosed. 

□ was filed in prior application number and such status is still proper and desired (37 CFR 1 .28(a)). 

5. Th e Commissioner is hereby authorized to charge any fees which may be required under 37 CFR 1 .16 and 
1.17, or credit any overpayment to Deposit Account No. 23-1925. A duplicate copy of this sheet is enclosed. 

6. H Enclosed is a check for $770.00 to cover the filing fees. 

7. ^ Cancel in this application original claims 1^1 of the prior application and otherwise enter the attached 

preliminary amendment before calculating the filing fee. 



8. |3 The inventor(s) of the invention being claimed in this application is(are): Kevin P. Bake r. David Botstein. Dan 

L. Eaton. Napoleone Ferrara. Ellen Ftlvaroff. Mary E. Gerritsen. Audrey Goddard. Paul J. Godowski, J. 
Christopher Grimaldi, Austin I. Gumev. Kenneth J. Hillan, Ivar J. Kliavin, Mary A. Napier. Margaret Ann Roy. 
Daniel Tumas. William 1. Wood . 

9. □ This application is being filed by less than all the inventors named in the prior application. In accordance 

with 37 CFR 1.63(d)(2), the Commissioner is requested to delete the name(s) of the following person or 
persons who are not inventors of the invention being claimed in this application: . 



Rev. Oct.-Ol 

C:\ALL\CNK\Continuations\Genentech l0466-484.doc 



10. Amend the specification by inserting before the first line the sentence: "This application is a S continuation 

□ division of application number 09/944,396 , filed This application is a continuation of. and claims priority 
under 35 USC Section 120 to. US Application 09/866,028 filed 5/25/2001. which is a contin uation of, and 
claims priority under 35 USC Section 120 to, PCT Application PCT/US99/28301 filed 12/1/1999, which 
claims priority under 35 USC Section 119 to US Provisional Application 60/113.29 6 filed 12/22/1998. where 
PCT/US99/28301 is a continuation-in-part of. and claims priority under 35 USC Section 120 to. US 
Application 09/254.311 filed 3/3/1999. now abandoned, which is the National Stag e filed under 35 USC 
Section 371 of PCT Application PCT/US98/25108 filed 12/1/1998, which claims priority under 35 USC 
Section 119 to US Provisional Application 60/069.873 filed 12/17/1997. " 

11. □ New formal drawings are enclosed. 

12. □ Priority of foreign application number , filed on respectively in is claimed under 35 

U.S.C.119. 

□ The certified copy has been filed in prior application number , filed 

13. A preliminary amendment is enclosed. 

14. E| The prinr application is assigned of record to GENENTECH, INC . 

15. ^ Also enclosed: Request To Use Computer Readable Form Of Sequence Listing F rom Parent Application 

Pursuant to 37 C.F.R. Section 1.821(e): and Copy of Sequence Listing (72) pages . 

16. S The power of attorney in the prior application is to: K. Shannon Mrksich, Reg. No. 36.675 and other 

attorneys at the firm of BRINKS HOFER GILSON & LIONE. 

a. ^ The power appears in the original papers in the prior application. 

b. □ Since the power does not appear in the original papers, a copy of the power in the prior application 
is enclosed. 

c. K Address all future correspondence to: (may only be completed by applicant, or attorney or agent of 
record.) 

K. Shannon Mrksich 

BRINKS HOFER GILSON & LIONE 



P.O. BOX 10395 
CHICAGO, IL 60610 

(312)321-4200 ^JYyXUlL^ 

Date Signature 

Name: K. Shannon Mrksich 
Reg. No. 36,675 



□ Inventor(s) 

□ Assignee of complete interest 
g) Attorney or agent of record 

□ Filed under 37 CFR 1.34(a) 

Registration Number if acting under 37 CFR 1.34(a): 
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United States Rvtent and Trademark Office 



UNITED STATES DEPARTMENT OF COMMERCE 
United States Patent and Trademark Office 
Addiw: COMMISSIONER FOR PATENTS 
P.O. Box 1450 

Alexandria, Virginia 223 13- 1450 
www.aspto.gov 



APPL NO. 


FILING OR 371 
(c) DATE 


ART UNIT 


FIL FEE RECD 


ATTY. DOCKET NO 


DRAWINGS 


TOTCLMS 


IND CLMS 


10/677,669 


10/01/2003 


1646 


770 


10466/485 


34 


5 


1 



K. Shannon Mrksich 

BRINKS HOFER GILSON & LIONE 

P.O. BOX 10395 

CHICAGO, IL 60610 



BRINKS HOFER GILSON & LIONE 

ECEIVE 




E C E J V E 

U. S. DOCKET 



CONFIRMATION NO. 1088 
FILING RECEIPT 

I II 1 1 II II II II I 

*OC00000001 1429949* 



Date Mailed: 12/05/2003 



Receipt is acknowledged of this regular Patent Application. It will be considered in its order and you will be 
notified as to the results of the examination. Be sure to provide the U.S. APPLICATION NUMBER, FILING DATE, 
NAME OF APPLICANT, and TITLE OF INVENTION when inquiring about this application. Fees transmitted by 
check or draft are subject to collection. Please verify the accuracy of the data presented on this receipt. If an 
error is noted on this Filing Receipt, please write to the Office of Initial Patent Examination's Filing 
Receipt Corrections, facsimile number 703-746-9195. Please provide a copy of this Filing Receipt with the 
changes noted thereon. If you received a "Notice to File Missing Parts" for this application, please submit 
any corrections to this Filing Receipt with your reply to the Notice. When the USPTO processes the reply 
to the Notice, the USPTO will generate another Filing Receipt incorporating the requested corrections (if 
appropriate). 

Applicant(s) 

Kevin P. Baker, Darnestown, MD; 
David Botstein, Belmont, CA; 
Dan L. Eaton, San Rafael, CA; 
Napoleone Ferrara, San Francisco, CA; 
Ellen Filvaroff, San Francisco, CA; 
Mary E. Gerritsen, San Mateo, CA; 
Audrey Goddard, San Francisco, CA; 
Paul J. Godowski, Burlingame, CA; 
J. Christopher Grimaldi, San Francisco, CA; 
Austin L. Gurney, Belmont, CA; 
Kenneth J. Hillan, San Francisco, CA; 
Ivar J. Kljavin, Pacifica, CA; 
Mary A. Napier, Hillsborough, CA; 
Margaret Ann Roy, San Francisco, CA; 
Daniel Tumas, Orinda, CA; 
William I. Wood, Hillsborough, CA; 



Assignment For Published Patent Application 

GENENTECH, INC.; 



Domestic Priority data as claimed by applicant 

This application is a CON of 09/944,396 08/30/2001 
and is a CON of 09/866,028 05/25/2001 PAT 6,642,360 
which is a CON of PCT/US99/28301 12/01/1999 
which claims benefit of 60/113,296 12/22/1998 
and is a CIP of 09/254,311 03/03/1999 ABN 
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which is a 371 of PCT/US98/25108 12/01/1998 
which claims benefit of 60/069,873 12/17/1997 

Foreign Applications 

If Required, Foreign Filing License Granted: 12/05/2003 
Projected Publication Date: 03/18/2004 
Non-Publication Request: No 
Early Publication Request: No 

Title 

Secreted and transmembrane polypeptides and nucleic acids encoding the same 

Preliminary Class 

530 



LICENSE FOR FOREIGN FILING UNDER 
Title 35, United States Code, Section 184 
Title 37, Code of Federal Regulations, 5,11 & 5.15 

GRANTED 

The applicant has been granted a license under 35 U.S.C. 184, if the phrase "IF REQUIRED, FOREIGN FILING 
LICENSE GRANTED" followed by a date appears on this form. Such licenses are issued in all applications where 
the conditions for issuance of a license have been met, regardless of whether or not a license may be required as 
set forth in 37 CFR 5.15. The scope and limitations of this license are set forth in 37 CFR 5.15(a) unless an earlier 
license has been issued under 37 CFR 5.15(b). The license is subject to revocation upon written notification. The 
date indicated is the effective date of the license, unless an earlier license of similar scope has been granted 
under 37 CFR 5.13 or 5.14. 

This license is to be retained by the licensee and may be used at any time on or after the effective date thereof 
unless it is revoked. This license is automatically transferred to any related applications(s) filed under 37 CFR 
1.53(d). This license is not retroactive. 

The grant of a license does not in any way lessen the responsibility of a licensee for the security of the subject 
matter as imposed by any Government contract or the provisions of existing laws relating to espionage and the 
national security or the export of technical data. Licensees should apprise themselves of current regulations 
especially with respect to certain countries, of other agencies, particularly the Office of Defense Trade Controls, 
Department of State (with respect to Arms, Munitions and Implements of War (22 CFR 121-128)); the Office of 
Export Administration, Department of Commerce (15 CFR 370.10 (j)); the Office of Foreign Assets Control, 
Department of Treasury (31 CFR Parts 500+) and the Department of Energy. 

NOT GRANTED 

No license under 35 U.S.C. 184 has been granted at this time, if the phrase "IF REQUIRED, FOREIGN FILING 
LICENSE GRANTED" DOES NOT appear on this form. Applicant may still petition for a license under 37 CFR 
5.12, if a license is desired before the expiration of 6 months from the filing date of the application. If 6 months 
has lapsed from the filing date of this application and the licensee has not received any indication of a secrecy 
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order under 35 U.S.C. 181, the licensee may foreign file the application pursuant to 37 CFR 5.15(b). 
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United States Patent and Trademark Office OG Notices: 18 March 2003 

Claiming the Benefit of a Prior-Filed Application under 
35 U.S.C. 119(e), 120, 121, and 365(c) 

Summary: This notice clarifies how benefit claims 

under 35 U.S.C. 119(e), 120, 121 and 365(c) must be presented in 
applications in order to be in compliance with the relevant statute and 
patent regulations, and accepted by the United States Patent and 
Trademark Office (Office) . 

35 U.S.C. 120 Benefit Claims 

Benefit claims under 35 U.S.C. 120 must include a 
specific reference to the earlier filed (nonprovisional ) application 
for which a benefit is sought. A "specific reference" requires: (1) 
the identification of the prior (nonprovisional) application by 
application number; and (2) an indication of the 
relationship between the nonprovisional applications, except 
for the benefit claim to the prior application in a continued 
prosecution application (CPA) . The relationship between any two 
nonprovisional applications will be an indication that the later-filed 
nonprovisional application is either a continuation, divisional, or 
continuation-in-part of the prior-filed nonprovisional application. 
When there are benefit claims to multiple prior nonprovisional 
applications (e.g., a string of prior nonprovisional 
applications), the relationship must include an identification of 
each nonprovisional application as either a continuation, 
divisional, or continuation-in-part application of a specific prior 
nonprovisional application for which a benefit is claimed. The 
identification is needed in order to be able to verify if copendency 
exists throughout the entire chain of prior nonprovisional 
applications . 

35 U.S.C. 119(e) Benefit Claims 

Benefit claims under 35 U.S.C. 119(e) must include a 
specific reference to the earlier filed provisional application for 
which a benefit is sought. A specific reference in this situation, 
however, only requires identification of the prior provisional 
application by the application number. No relationship between the 
subject nonprovisional application and the prior provisional 
application should be specified. If the subject nonprovisional 
application was not filed within twelve months of the filing date of 
the provisional application, the subject application must also include 
a benefit claim under 35 U.S.C. 120 to an intermediate prior 
nonprovisional application that was filed within twelve months of the 
filing date of the provisional application. Therefore, in addition to 
the identification of the provisional application, the proper benefit 
claim for this situation must also identify the intermediate prior 
nonprovisional application that is directly claiming the benefit of the 
provisional application, and indicate the relationship between the 
nonprovisional applications (e.g., an indication that the 
subject application is a continuation of the intermediate prior 
nonprovisional application) . 

Statement of the Problem 
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Background: Some applicants have been 

submitting patent applications which include, or are amended to 
include, at the beginning of the specification, a statement that 
benefits are claimed under 35 U.S.C. 119(e) and/or 120 to prior 
applications, followed by a listing of many prior nonprovisional and 
provisional applications. The listings do not indicate: (1) the 
specific relationship (i.e., continuation, divisional, or 
continuation-in-part) between the nonprovisional applications, as 
required by 37 CFR 1 . 78 (a) (2) (i) ; and/or (2) each nonprovisional 
application which is directly claiming priority to a provisional 
application. Without such information, the Office does not have 
sufficient information to enter the benefit claims into the Office's 
computer database. 

When entering benefit claims for an application into the Office's 
database, the relationship (i.e., continuation, divisional 
or continuation-in-part) between the nonprovisional applications is 
required. Further, the system will not accept any benefit claim to a 
provisional application if the provisional application was filed more 
than twelve months prior to the filing date of the subject application, 
unless the applicant clearly identifies, and claims the benefit of, a 
prior nonprovisional application that was filed within twelve months of 
the filing date of the provisional application. Accordingly, if benefit 
claims are presented without all the required information, the Office 
will not be able to enter such benefit claims into the Office's 
database, the filing receipt will not reflect the prior application (s ) , 
and the projected publication date will not be calculated as a function 
of an earlier application's filing date. 

The specific relationships between (each of) the nonprovisional 
applications in a chain of nonprovisional applications are also 
important because such information will be printed in the application 
publication, and/or patent. Furthermore, the designation of an 
application as a continuation (rather than as a continuation-in-part) 
is an indication that the entire invention claimed in an application 
has support in the prior application, whereas the designation of an 
application as a continuation-in-part is an indication that the claimed 
invention is not entirely supported by the prior application. Thus, the 
specific relationship between nonprovisional applications in a chain of 
benefit claims, and the indication of the specific nonprovisional 
application (s) that is directly claiming the benefit of a provisional 
application, will provide the information that is needed by examiners 
and the public in order to determine the effective prior art date of 
the application publication, or patent, under 35 U.S.C. 102(e). 

When benefit claims are required to, but do not, include: (1) an 
identification of (all) intermediate benefit claims, and/or (2) the 
relationship between nonprovisional applications, the Office may not be 
able to publish applications promptly after the expiration of a period 
of eighteen months from the earliest filing date for which a benefit is 
sought under title 35, United States Code (eighteen-month publication), 
nor have the accuracy desired of such benefit claims in application 
publications. Further, the objection (by the Office), correction (by 
applicant), and review/entry of changes (by the Office) cycle for 
non-compliant benefit claims is a burdensome effort on both applicants 
and the Office that can be totally avoided if such benefit claims are 
properly submitted the first time. Accordingly, it is hoped that 
applicants will submit benefit claims with all the required information 
as set forth in this notice and, correspondingly, avoid submitting 
non-compliant benefit claims that leads to extra work for both the 
Office and applicants. 

Procedures for Making Proper Benefit Claims 
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Part I: Reference to Prior Nonprovisional Application (s) 

Per 37 CFR 1 . 78 (a ) ( 2 ) ( i ) Must Include Identification of, and 

Relationship Between, Applications 

35 U.S.C. 120 provides that no application shall be entitled 
to the benefit of an earlier filed application unless it contains, or 
is amended to contain, a specific reference to the earlier filed 
application. The specific reference required by 35 U.S.C. 120 is the 
reference required by 37 CFR 1.78(a)(2). 37 CFR 1 . 78 (a) (2 ) { i) 
requires that any nonprovisional application that claims the 
benefit of one or more prior-filed copending nonprovisional 
applications, or international applications designating the United 
States, must contain, or be amended to contain, a reference to 
each such prior-filed application, identifying it by application number 
(consisting of the series code and serial number), or 
international application number and international filing date, 
and indicating the relationship of the applications. 37 CFR 
1 . 78 (a) (2) (iv) also provides that a request for a continued prosecution 
application (CPA) under 37 CFR 1.53(d) is the specific reference 
required under 35 U.S.C. 120 to the prior-filed application. 
Therefore, except for the benefit claim to the prior-filed 
application in a CPA, benefit claims under 35 U.S.C. 120, 
including claims under 35 U.S.C. 121 and 365(c), must not 
only identify the earlier application by application number, or by 
international application number and international filing date, 
but they must also indicate the relationship between the applications. 

Examples 

The relationship between the applications is whether 
the subject application is a continuation, divisional, or 
continuation-in-part of a prior-filed nonprovisional application. An 
example of a proper benefit claim is: "This application is a 
continuation of Application No. 10/ - , filed - . " A benefit claim that 
merely states: "This application claims the benefit of Application 
No. 10/ - , filed - ." does not comply with 37 CFR 1 . 78 (a) ( 2 ) ( i ) , since 
the relationship between the applications, is not stated. In 
addition, a benefit claim that merely states: "This application is a 
continuing application of Application No. 10/ - , filed - ." does not 
comply with 37 CFR 1 . 78 (a) ( 2 ) ( i ) since the proper relationship, 
which includes the type of continuing application, is 
not stated. It is also noted that the status of each nonprovisional 
parent application (if it is patented or abandoned) should also be 
indicated, following the filing date of the parent nonprovisional 
application. An example of a proper benefit claim of a prior national 
stage of an international application is "This application is a 
continuation of U.S. Application No. X, which is the National Stage of 
International Application No. PCT/US - / - , filed - . For additional 
examples of proper benefit claims, see Manual of Patent Examining 
Procedure (8th ed., August 2001) (MPEP), Section 

201.11, Reference to First Application. Section 201.11 of the MPEP will 
be revised in the upcoming revision to reflect the clarification made 
in this notice about the required manner of making proper claims for 
the benefit of prior nonprovisional and provisional applications. 

As stated previously, to specify the relationship between the 
nonprovisional applications, applicant must specify whether the subject 
application is a continuation, divisional, or continuation-in-part of 
the prior nonprovisional application. Note that the terms are 
exclusive. An application cannot be, for example, both a continuation 
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and a divisional, or a continuation and a continuation-in-part, of the 
same parent application. Moreover, if the benefit of more than one 
nonprovisional parent application is claimed, the relationship must 
include an identification of each nonprovisional application as a 
continuation, divisional, or continuation-in-part application of the 
immediate prior nonprovisional application for which a benefit is 
claimed in order to establish co-pendency throughout the entire chain 
of prior-filed parent nonprovisional applications. For example, the 
following two statements are improper: "This application claims the 
benefit of Application Nos. C, B, and A." and "This application is 
a continuing application of Application Nos. C, B, and A." On the 
other hand, the following statement is proper and acceptable: "This 
application is a continuation of Application No. C, filed - , which is 
a continuation of Application No. B, filed - , which is a continuation 
of Application No. A, filed - ." 

Sanctions for Making Improper Benefit Claims to 
Nonprovisional Applications 

Any benefit claim under 35 U.S.C. 120, 121 or 365(c) that 
does not identify a prior application and also specify a 
relationship between each of the applications will not be in 
compliance with 37 CFR 1 . 78 (a) (2) (i) , and will not be considered to 
contain a specific reference to a prior application as required by 35 
U.S.C. 120. Such a benefit claim will not be recognized by the 
Office and. will not be included on the filing receipt for the 
application, even if the claim appears in the first sentence of the 
specification or an application data sheet (37 CFR 1.76), because the 
Office does not have sufficient information to enter the benefit claim 
into the Office's database. As a result, publication of the application 
will not be scheduled as a function of the prior application's filing 
date. The Office plans to notify applicants on, or with, the filing 
receipt that a benefit claim may not have been recognized because it 
did not include the proper reference. Applicants are advised that only 
the benefit claims that are listed on the filing receipt have been 
recognized by the Office. Since the filing receipt and the notification 
will usually be provided to the applicant shortly after the filing of 
the application, applicants should have sufficient opportunity to 
submit the proper benefit claims within the time period set in 37 CFR 
1.78 (a) (2) (ii) and thus avoid the need to submit a petition under 37 
CFR 1.78(a)(3) and the surcharge set forth in 37 CFR 1.17(t). Failure 
to timely submit the reference required by 37 CFR 1.78(a) (2) (i) is 
considered a waiver of any benefit claim under 35 U.S.C. 120, 121 or 
365(c) unless a petition to accept an unintentionally delayed claim 
under 37 CFR 1.78(a)(3), the surcharge set forth in 37 CFR 1.17(t), and 
the required reference, including the relationship of the 
applications (unless previously submitted) are filed. For example, 
if a benefit claim is submitted without the specific relationship 
between the nonprovisional applications before the expiration of the 
period, and the specific relationship between the nonprovisional 
applications is subsequently submitted after the expiration of the 
period, a petition and the surcharge would be required. 

Part II: Reference to Prior Provisional Application (s ) Per 

37 CFR 1.78 (a) (5) (i) Should Only Include Identification of Prior 

Provisional Application (s ) 

When the domestic benefit of a prior provisional application is 
being claimed under 35 U.S.C. 119(e), however, the relationship 
between the two applications should not be specified. 35 
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U.S.C. 119(e) provides that a nonprovis ional application claiming 
the benefit of a provisional application must be filed within twelve 
months of the provisional application and must contain, or be amended 
to contain, a specific reference to the provisional application. The 
specific reference required by 35 U.S.C. 119(e) is the reference 
required by 37 CFR 1.78(a)(5). 37 CFR 1 . 78 (a) (5) (i) requires that any 
nonprovisional application, or international application designating 
the United States, claiming the benefit of one or more prior-filed 
provisional applications must contain, or be amended to contain, a 
reference to each such prior-filed provisional application identifying 
it by provisional application number. No relationship should be 
specified whenever a claim for the benefit of a provisional application 
under -35 U.S.C. 119(e) is made. 

If a relationship between a nonprovisional application and a prior 
provisional application is submitted, however, it may be unclear 
whether applicant wishes to claim the domestic benefit of the 
provisional application under 35 U.S.C. 119(e), or the benefit of an 
earlier application's filing date under 35 U.S.C. 120. Thus, 
applicants seeking to claim the domestic benefit of a provisional 
application under 35 U.S.C. 119(e) should not state that the 
application is a "continuation" of a provisional application, nor 
should it be stated that the application claims benefit under 35 U.S.C. 

120 of a provisional application. If such a claim is submitted in an 
application transmitted to the Office other than through the Electronic 
Filing System, it will be entered into the Office computer system as a 
claim to the "benefit" of the provisional application. Although 35 
U.S.C. 120 does not preclude a benefit claim to a provisional 
application (that is, one could obtain the benefit under 35 U.S.C. 
120 -of a prior filed provisional application), such a benefit claim 
under 35 U.S.C. 120 is not recommended as such a claim 
may have the effect of reducing the patent term, as the term of a 
patent issuing from such an application may be measured from the filing 
date of the provisional application pursuant to 35 U.S.C. 154(a)(2). 
Instead, applicants should state "This application claims the benefit 
of U.S. Provisional Application No. 60/ - , filed - ", or "This 
application claims the benefit of U.S. Provisional Application No. 
60/ - , filed - , and U.S. Provisional Application 60/ - , filed - 
See MPEP 201.11, Reference to First Application 
(8th ed., August 2001). 

Part III: If Benefit is Claimed of a Prior Provisional 
Application Which was Filed More Than One Year Before the Subject 
Application, Then Each Prior Nonprovisional Application (s ) Claiming 
Benefit of the Provisional Must be Specified 

Any nonprovisional application that directly claims the benefit 
of a provisional application under 35 U.S.C. 119(e) must be filed 
within twelve months from the filing date of the provisional 
application. As noted above, an application that itself directly claims 
the benefit of a provisional application should identify, but not 
specify the relationship to, the provisional application. If the 
subject nonprovisional application is not filed within the twelve month 
period, however, it still may claim the benefit of the provisional 
application via an intermediate nonprovisional application under 35 
U.S.C. 120. The intermediate nonprovisional application must have 
been filed within twelve months from the filing date of the provisional 
application and the intermediate nonprovisional application must have 
claimed the benefit of the provisional application. Further, it must be 
clearly indicated that the intermediate nonprovisional application is 
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claiming the benefit under 35 U.S.C. 119(e) of the provisional 
application. This identification of the intermediate nonprovisional 
application is necessary so that the Office can determine whether the 
intermediate nonprovisional application was filed within twelve months 
of the filing date of the provisional application, . and thus, whether 
the benefit claim is proper. 

Examples 

Applicant should state such a benefit claim as follows: "This 
application is a continuation of Application No. C, filed - , which is a 
continuation of Application No. B, filed - , which claims the benefit of 
U.S. Provisional Application No. A, filed - . " A benefit claim that 
merely states "This application claims the benefit of nonprovisional 
Application No. C, filed - , nonprovisional Application No. B, filed - , 
and provisional application No. A, filed - " would be improper where 
the subject application was not filed within twelve months of the 
provisional application. 

Where the benefit of more than one provisional application is being 
claimed, the intermediate nonprovisional application (s ) claiming the 
benefit of each provisional application must be clearly indicated. 
Applicant should state, for example, "This application is 
continuation of Application No. D, filed - , which is a 
continuation-in-part of Application No. C, filed - , Application No. D 
claims the benefit of U.S. Provisional Application No. B, filed - , and 
Application No. C claims the benefit of U.S. Provisional Application 
No. A, filed - ."An example of a proper benefit claim of a prior 
national stage of an international application, which claims the 
priority to a provisional application, is "This application is a 
continuation of U.S. Application No. Y, which is the National Stage of 
International Application No. PCT/US - / - , filed - , which claims the 
benefit under 35 U.S.C. 119(e) of U.S. Provisional Application X, 
filed - . " 

Sanctions for Making Improper Benefit Claims to 
Provisional Applications 

If a benefit claim to a provisional application is submitted 
without an indication that an intermediate nonprovisional application 
directly claims the benefit of the provisional application and the 
instant nonprovisional application is not filed within the twelve month 
period, or the relationship between nonprovisional applications is not 
indicated, the Office will not have sufficient information to enter the 
benefit claim into the computer database. Therefore, the Office will 
not recognize such a benefit claim, and will not include the 
benefit claim on the filing receipt. The Office plans to notify 
applicants on, or with, the filing receipt that a benefit claim may not' 
have been recognized because information regarding the intermediate 
nonprovisional application (s ) and/or the relationship between each 
nonprovisional application have not been provided. Applicants are 
advised that only the benefit claims that are listed on the filing 
receipt have been recognized by the Office. Since the filing receipt 
and the notification will usually be provided to the applicant shortly 
after the filing of the application, applicants should have sufficient 
opportunity to submit the proper benefit claims within the time period 
set in 37 CFR 1.78(a) and thus avoid the need to submit a petition 
under 37 CFR 1.78(a) and the surcharge set forth in 37 CFR 1.17(t). 
Failure to timely submit the reference required by 37 CFR 1.78(a) is 
considered a waiver of any benefit claim under 35 U.S.C. 119(e), 
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120, 121 or 365(c) unless a petition under 37 CFR 1.78(a), the 
surcharge set forth in 37 CFR 1.17(t), identification of the 
intermediate nonprovisional application which claims the benefit to the 
provisional application, and the relationship between each 
nonprovisional application are filed. 

Part IV: Office Practice to Not Require Petition and 

Surcharge if Benefit Claim is Not Present in the Proper Place But is 
Recognized By Office Continues But Applicants Are Advised That Proper 
Reference Must be Presented 

The reference required by 37 CFR 1.78(a)(2) or (a) (5) must be 
included in ah application data sheet (37 CFR 1.76), or the 
specification must contain, or be amended to contain, such reference in 
the .first" sentence following the title. Previously, the Office 
indicated that if an applicant includes a benefit claim in the 
application but not in the manner specified by 37 CFR 1.78(a) 
(e.g., if the claim is included in an oath or declaration 
ot the application transmittal letter) within the time period set forth 
in 37 CFR: 1.78 (a) , the Of fice will not require a petition under 37 CFR 
1.78(a) and the surcharge under 37 CFR 1.17(t) to correct the claim if 
the information concerning the claim was recognized by the Office as 
shown by its inclusion on the filing receipt. If, however, a claim is 
included elsewhere in the application and not recognized by the Office 
as shown by its absence on the filing receipt, the Office will require 
a petition and the surcharge to correct the claim. See 
Requirements for Claiming the Benefit of Prior-Filed Applications 
Under Eighteen-Month Publication Provisions, 66 Fed. Reg. 67087, 
67089-90 (Dec. 28, 2001). The Office will continue to follow this 
practice . 

Sanctions for Making Improper Benefit Claims 

Applicants are simply being advised by this notice that the 
Office will not recognize any benefit claim where there is no 
indication of the relationship between the nonprovisional applications, 
or no indication of the intermediate nonprovisional application that is 
directly claiming the benefit of a provisional application. Applicants 
are also reminded that, even if the Office has recognized a benefit 
claim that includes the proper reference by entering it into the 
Office's database and including it on applicant's filing receipt, the 
benefit claim is not a proper benefit claim under 35 U.S.C. 119(e) 
and/or 35 U.S.C. 120, and 37 CFR 1.78, unless the reference is 
included in an application data sheet, or the first sentence of the 
specification, and all other requirements are met. 

Part V: Correcting or Adding a Benefit Claim After Filing 

The Office will not grant a request for a corrected filing 
receipt to include a benefit claim unless a proper reference to the 
prior application(s) is included in the first sentence of the 
specification, or an application data sheet, within the time period 
required by 37 CFR 1.78(a). Any request for corrected filing receipt to 
include a corrected or added benefit claim must be submitted within the 
time period required by 37 CFR 1.78(a) and be accompanied by an 
amendment to the specification, or an application data sheet. If the 
proper reference was previously submitted, a copy of the amendment, the 
first page of the specification, or the application data sheet, 
containing the claim should be included with the request for corrected 
filing receipt. The Office plans to notify applicants on, or with, the 
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filing receipt that a benefit claim may not have been recognized 
because it did not include the proper reference. Applicants are advised 
that only the benefit claims that are listed on the filing receipt have 
been recognized by the Office. Since the filing receipt and the 
notification will usually be provided shortly after the filing of the 
application, applicants should have sufficient opportunity to submit 
the proper benefit claims within the time period set in 37 CFR 1.78(a) 
and thus avoid the need to submit a petition under 37 CFR 1.78(a) and 
the surcharge set forth in 37 CFR 1.17(t). Therefore, applicants 
should carefully and promptly review their filing receipts in order to 
avoid the need for a petition and the surcharge. 

When an unintentionally delayed benefit claim is submitted with a 
petition under 37 CFR 1.78(a) and the surcharge set forth in 37 CFR 
1.17(t), the benefit claim must include a proper reference to the prior 
application(s) in order for the petition to be granted. The reference 
to the prior application (s ) must include: (1) the relationship between 
nonprovisional applications (i.e., continuation, 

divisional, or continuation-in-part), and (2) the indication of any 
intermediate application that is directly claiming the benefit of a 
provisional application, in order to establish copendency throughout 
the entire chain of prior applications. 

Applicants are also reminded that, if an amendment to the 
specification, or an application data sheet (ADS), is submitted in an 
application under final rejection, the amendment or ADS must be in 
compliance with 37 CFR 1.116. The amendment or ADS filed in an 
application under final rejection will not be entered as a matter of 
right. See MPEP 714.12 and 714.13. Therefore, 

applicants should consider filing a request for continued examination 
(RCE) (including fee and submission) under 37 CFR 1.114 with the 
petition to accept an unintentionally delayed benefit claim, the 
surcharge, and an amendment that adds the proper reference to the first 
sentence of the specification or an ADS. 

Part VI: Each Intermediate Prior Application Must Have Proper Reference 

If the benefit of more than one prior application is claimed, 
applicant should also make sure that the proper references are made in 
each intermediate nonprovisional application in the chain of prior 
applications. If an applicant desires, for example, the following 
benefit claim: "This application is a continuation of Application No. 
C, filed - , which is a continuation of Application No. B, filed - , 
which claims the benefit of U.S. Provisional Application No. A, 
filed - ," then Application No. C must include a benefit claim 
containing a reference to Application No. B and provisional Application 
No. A, and Application No. B must include a benefit claim containing a 
reference to provisional Application No. A. 

Part VII: Adding an Incorporation-By-Reference Statement in 
a Benefit Claim is Not Permitted After Filing 

An incorporation-by-reference statement added after the filing 
date of an application is not permitted because no new matter can be 
added to an application after its filing date. See 35 
(J.S.C. 132(a). If an incorporation-by-reference statement is 
included in an amendment to the specification to add a benefit claim 
after the filing date of the application, the amendment would not be 
proper. When a benefit claim is submitted after the filing of an 
application, the reference to the prior application cannot include an 
incorporation-by-reference statement of the prior application. 
See Dart Industries v. Banner, 636 F.2d 684, 
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207 USPQ 273 (C.A.D.C. 1980). Therefore, the Office will not grant a 
petition to accept a benefit claim that includes an 

incorporation-by-reference statement of a prior application, unless the 
incorporation-by-reference statement was submitted on filing of the 
application. 

Inguiries regarding this notice should be directed to Eugenia A. Jones 
or Joni Y. Chang, Legal Advisors, Office of Patent Legal 
Administration, by telephone at (703) 305-1622. 

February 24, 2003 STEPHEN G. KUNIN 



Deputy Commissioner for 
Patent Examination Policy 
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DECLARATION OF AUDREY P. GODDARD, Ph.P UNDER 37 CF.R. S 1.132 

Assistant Commissioner of Patents 
Washington, D C. 20231 



Sir 

I, Audrey D. Goddard, Ph D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2 . Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 



Serial No.: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (Le., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PGR assay is described, for example, in the following scientific 
publications: Higuchi et al. y Biotechnology 10:413-417 (1992) (Exhibit B); Livak et aL 9 PCR 
Methods AppL 4:357-362 (1995) (Exhibit C) and Heid et aL, Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica etaL, Proc. 
Natl. Acad. Sci. USA 95(25): 14717-14722 (1998) (Exhibit E); Pitti et aL, Nature 
396(6712):699-703 (1998) (Exhibit F) and Bieche etaL, Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 



Serial No,: * 
Filed:* 

7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 

Date 



Audrey D Goddard, Ph.D. 



-3- 
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i'both aimptifiec and impress 
|nny facilitate i« automation and more 
Twidesaread use in the cUmc or m other 
etio^requiring high sample through- 



DUt 

8 . * UKovv* ch« pot-auaJ ben-Jits of PCR 1 «h 
K.A W cfopno.ci<» «e wdl known", a » a« 

>1 JL four yea« rind thermostable DNA 
^ made FCR practical. Some of the reasons for n dow 
mare r ^ r Jrr~, i-j, c f automation of pre- and 
ccpranrr arc high cat. »tx o* f roin 

fch-yovcr-cootanunation. The hnt wo po«^ « 

*Sa£ labor is the larcot ^™ buwr ;^ t 5l 1 ^*s £T uir e 

*}foe form of ' dowuaircani procouang » . . 
gk U oone to order ~ J-j^ «*JJ«J- 
PNA sequence was pracw »•« ^ ot 

ftctrophoreris' 9 . Thwc method* ar- ^*~-JK 
^ throughput, and arc dffieult » ™S» 
IMii U a Uo closely related to dovtutream pro^i^ 
Etht tuadliog or .he PCR product ,n ^^^^ 
Smce«es uJrea»« tbe chance, i that i^J ™£ ^ 
k >«d through the typing lab, r^ulunp ua a nu 



Th«c dow^^ r^f"^^^ 0 f amplified 
nated if apc-nc ^P^f^^t^nw unopened re- 

action ressrf. As>«ya « w ^°~l" reacUo „ componentt 

remold iaadowo«<r««« P"^^*™^ « »«ay * 

whjch the cadopanou* 9 labeled oHp>nadeo- 

PS ly ^ e ThcA^d^ycie^if^ 

tul e probe T^rt ^ ^ucae*. In 

UNA hindine dye: .exhibit vhen «icy, nro rnrynic PCR 
UNA 1 *" lB . A3 outlined in Tigur* 1. a prototypic 



*«e law 



/ 



»vta— - 




^iO K>0^d Pf) 

( . Fdndplc of il^- ^^qf^r^ 
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. expeaed 
product 



diog. 



^ U^f PGR ampliation products ofthc 

human, S<*f • "w U pco S ^ml). The P*? 0 ^? 

Uvcc^rtng am ^^°f^cf J^^dor sped*** 




r prgu thai conoid 
HCBU J (Ainuor=«oca »^=?^ r vSireinoMmc rep«i 

,e«nZ£ees Kverepbeuc: ?^«2^ „f fi-e .epHcaie 
Buorawenc. measured. Umo of ™^ff^ a rf- 3ir U, polypto- 



t<s lns with primerc « f^fe^S?^ 
SSTa contain!** 

*o typically f"^^^^'™ 
on the appUcaaon, froj> pracall «h« realms 

micrograms p« PCR - » * : f \JL'^ in - fl uortS <*uce. are 
diaiwll fluoresce, m order of • .^te^Mled 

^ £t*r Itself. »*J^?SSSiSd Sfiot D»A(hy 

ice interealacion bee— en tne . 3 rL a£U j a doa eyde. target 
double-hellX). After the fi T"*^i After a PC* 'S 

completed. d» ^«g^SS5S» SttdW of up to 
the amount of dsDNA *^^M*£Sr » »o«n4 to Ox 
,everal micrograms. ^^Sl^aue * nuor«- 
•d&bnal diDNAj resute«6 ^Lg^gRE amount of 
«nce. There is also ^^^Mfc to »DNA 
jjDNA primer. buc ^^^SeI 0 f this change »o 
b much lc than » daDNA, « 0 W tmall The Adores- 



RESISTS „ t-s order go a*se*i chc 

affect Of Xarj»PCK,«^ dye present at 

D«« gene" -rf^to ug'ml (• tfK** <-» n '^ 
eooeennaoonj *ob,«J^»» of liocloc aod* following 

gel dectrophor«^5 A, m roe yleld 

Lctrophoro" ^^^XTp^ducc whether htK,- wa 
or quafity of the M *^^^eon«^tx»ooo«, indtot- 
abietn or present at any « "ese ™» 

l>etectu)n of !^Kc fluorejeence «nh"ioc.neQC of 

l^r a* » result of »2ro » i *aft* EtB * ^ d P timftrt 
.topliruanons ^L^i ^found on th« human 

Y-ehromosome . T; ne^ r . ^ Uln]m . 0 r oo DN A. 

60 n„ M LW feln ^'wKr each DNA. After 0. 
Hve replicate t ?CJU r^f^^diaff. a PGR for e«h 
17,21,^4 and and i» Ruo- 
DNA wa, removed from ^^^er and pWed 
rciccnce »««r«J In a Jg^p 3A) . The shn« ef thu 
vj.amplSotlon cycle numl^U'B^^ ; ■ xx 1r 
Sivc reflecw the face d»Jr .^ eisc ^ DNA tJ 
fluoraeeiie- can he ^ ^ ^ cle number 

becoming ^: r ^ c d " c^nl^ed about three-fold 
A3 shown, d>e a *««^S een0C for the conrain- 
over the jigaiftoitdy Increase. 

lng human male DNA^^C at *^ cd tiiae r oo 

to neeadvc ^crol v^cn^ roiJc UNA 

DMA or human f^gJ^vSu, 2 ng-^ fcwer 

cycto v-erc needed p^due^ ot these 

Sconce. Gel C^n2^>NA fragment, of the ex- I 
^pUncaOons sho **~i?i a ^1 male DNA contain'^ 
SS^'S.SfDKA ^ toohpUee m the 

-rac^C^cre^^ 
by .hn P V laying ^ .^ p ^ D Ca*em through a r«c 

fUtcr. Thi. « £ with no DNA- 

bcpin -nth a np maic nr'A ^. « ^ ;lo ^„ 

p^e- ln order to demon^e in* hi U ct=<:uor 

liquate ^acfieuy toantn- f ^^^ed. Kpir. 
of U« sicWVeeU anemU "J^TlSfiS amplification. 
* *"?"? ^SErWStf - ^ by ph P oro 6 raphr 

ther the wild-type « ^-^mutauon^^^ 



^obmgcne^^espeato^foreoehauai 

nudeoildcof on. pnmer. ny "flnE^P^.^ oius >f 
annealmg temperature^ L^WinbTs^o^ 1*? 
plificado«-<aa »1« place only ir tbe ^^1- 

P Each pair of a mpU6e»uoW tho«m « *jS£V ;fic ^ 

cube) or rtddc-JW* DNA from Vh^^a: 

different UNA- mw ' W*^ f^. f^ & h*camiygo*; 
-Ud-type P:^.^ d fi^d from a homoaye^ 
eieWe ^.^ffSWA C60 n R ge^ ot ^ 



414 KOTCCHNCXO©/ ^ol 10 AP(a ^' 92 



Ul/Z/VZUUO 10:^0 fAA OOU 0*4 UOOO 



e 



25°C 



Mf ce c ^ ^ m P iuSI on a !pec*ofluorofl. 
prune- ^L^^hiT^reJCenoc «i about thro* ume* 

5?* where both P-i^^^S 
th* P*" 6 "?'* pttoe r set Gel decsophorea* {»£ 
^ ma rd&Jwd Shis Srew -> ft-o^n^^" 
ehovm) ^"^LiTTf narfy a microgram of * D « A 

^niS 0 J SSSTm reason* in which the ailele- 
UtdeiVa^ e "-°^ ln i tmatc ne d to both allcleji 

wd2«t excitation Uluiranooon fror* 
and to rcrum ^ Tea d ou t ai such an inanccm****- 

™. scCCifiC scouenccs from 26 «tg 01 mjmannvHs > ■ ^"^^ fi-Jtobin acne. Th« '^"i*7°iG„ ^ right tube 

■ iiy rises and falls J*^*?*** ™. T e P aenacuradon W 

fluorescence maxima and minima ?_° f ° C .~"S: .V 0 ^ - u 
Wy over the thirty themoeydes. "S et 
fode d«UN'A synthesis ^n.^V^Sdwta* 
Ma. «d there b h«lc if any bUachinR of Mir durlnf 

.fce eonOnuous fUuminadoo of the sompje. 

*ln ^fpCK eontamtas male DNA, the fl"*^ 1 " 

& » abt.uc 4000 seconds of Aormoeycta 
anting to increase with time, :nieoUop «h*c « 
Sur product « a detectable «evcl. Now thai ih. fluo, 
rSe^nima at the denaturatlon temperature do£ot 
ft^tly increase, pretumably because ai d» 'temper- 

; fof the amplitieadon « Colloid by cracking *e*£^- 
H-< ■« hw«* 41 *• ann^ng temperature, of 
iT-tfcevroducts of cheee two amplifications by eel clectropho- 
PSSS a DN A fcoflnunt or the eg-g^gg- 
J .tale DMA containuu; sample and 00 detectable uma 
synthesis for the control sample. 

H^oSSSSi process such ee hybridtoMT 1 M II* 
iqtence-speo&c probe can cnh««c fcjW^ 
W,Watm by FCR. The cUminauon of cbe*« P r f"**5 
that the spedficity of this hc<ne*«neou assay 
Spends solely on that of PCR. lo the .ease of 
Uease. we have Shawn thai PGR ok™ bo. "A°«^ A . 
Ksequcnce specificicy co permit 
Nproprntt ampHtodon cossdSnon* ' there m iWde i non- 
tocori prodocfion of dsDNA in the ab«*nc<i of the 

PrT^ff^ to detect pathos canb^ 
a Vnre or Us* th» that required to do genetic jer^; 
!Ucpendit. ? or. the number of pathogens » the sample and, 
;4'He «ooum of other UNA that must be taken .with. *« 
K&iple. A difficult «re« it Hi", which rfapj-wde^t^; 
^P< a «iral genome tho? can he at the level of ai*" .<»P*? 
[•■per thcuS.de of host cells*. UJnraared 
*w«nin Kl which is performed on celb containing at leaat 

PM more spicifiacy aid the input t,f more total 
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20ng of mola 0NA 




2000 



8000 



4-000 6000 
time (acc) 

sutrSnt with SO n 8 ot human «3k »NM»j).^m of 
: PCR-«tho« UKAftotwort. ^5SSlmre «53S Utw«» 
PCR weir fonowed hJt ,3?*J^^d attol jloo). Not. in 

^^^t^eaL^aV Mr7 P crature. 
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or<^cr to have su£~ 1 
DNA— ap » microgram am0 ^~~^fl lQr g C amount of 

sorting DNA m no »*P£f S ^ f^tt 
the background f l ,lor ?t~vA> most bo detected- .An 
Sfuorec^nce produced ^ w to lo- 

,dd»do.iaI rampli«oo° t^t^ 1 ^. ^ "pruMr-dxmor 
cooy-flumher « &c forrnaoon —e^ioo. of one prtiacc 
This » ^^V^rf^cbouga^occun 
mmgdae Other pruaeraia^ini»- product Is » 

^uently. one* it <*?*l£? ^£ cUpetc vrich 
subsWK ^ ^./^^ £e rare, ?u£ prtaer- 
tm« P'.:h org™ »| IIdNA and cbu* U a poK^ 

so ur« of fclse agn J « ^^SdrSwc the effect of 
To increase PCR Satigedng a »u* 
primer-dueer ampbficeuo?. « « ™ 7=* ( Wd-primeT 
STof approaches, jndu^g w b,». and tM 

ajipBficauoni that take P*«> » » ^adon u reduced 
ToW. in C dT&,n before DNA 

by ^*8^^P|X^ remits using 

becoiU P™^ancJ-0 i^uccj* DNA-bindiuf dye* 

theoUgonudeoQdepnmcr-^ ^ of fluoro cence 
We have «hown Chat ^ . ^p u .n^ijihtforxard, 
KBCT-ed by an U ^3y during 

&h uncc PCR . corned rff 
thcrmocydms. Tb* ease vout » ^ tf the most 
UNA detect. « ^ */^gE«aC«n« analyaa 
promising ajpectof tins aflM^ eJ d.tiflg insmi- 
ot completed PCRa U ^^(PtlX" --- -w^.,«r«- 
mentation in 96-well format" . In this 
p , o/~d ran be o manure* 



tf a .ample 

£«r this many 9?t£S^^& « *• P rescn " 
j„ *k «*-?- * o^ita*. auch controls may 
or abteoa of 8uor«o*nce signal • fc ^ on this 
£ important. In wr «"« , h,«. ft ^ of its false 

p^le is ready for ^^SffiSU obtained using 

a Jsree number of kflovm «afflpl«_ ^ 
Ionian. d»eu^«wnm POT o* ay m3j:es 

powlble W detect specific DN ^mpon 

if^SaW ^d.rougtvput of 



A 




oL(.uti<f«< - Jtc i„ format, the nuOC es- 

reader*. . l0 co ntinuou«ly tnonitpr 

The UutruB«U»».«n no «"^ Kmple in principle. 

A dlr^cc extension of tbe *PF" a ^;J" • ^ and flU- 
or,.cint Emissions co and ^«2f ^,,)ta quan- 

to Monitor "^f* 'SSSKS ' » ^ 
omtion ot target DNA ^J !J~nr rtl I)N.V, die cooner 
^ larger the ^^^^Kcaed. -Pretinti- 
duriog PCR a a^rc' oen f i e *~? DoUincer. a»nu«ript ,n 

rST" ^rSdT^nc. in iniua! target ONA. 

tarpt moleeule«, ! true P^"S S eydes of PCR. . 
SoTeaeenee by a F^^^ ate ^ 

cycle would i^^^SaSo^f UNA polymer* 



Yl.Tl~<Vt.2 , "i» wl d,h£r 5 ? g ^L,' M«CTei- 1 min. and 8CC, , 

C,5t -- " —aline -mperaturc =f j5 ) ^ or . Ccr P £-J 

PMX «^JP»i,fSS 0048S an. pMff.WSgi 
about 2 ".^^^dide aecond/mder V^-.^S 

ut ^.uT^Sff fiW was "^£^1^. Co<v« 

^ptlc aocatiory (SrTXj «^ »* J p C R pl.ced in a «eU « 
UgSi. m d revive «2^ h *&SE£ C^uT). TM P"S« e J 

open top of "*rJt.T^ff«u,dy waline u. ^: c ^ rtidJ^ 



Sl "due to for ^£S^SZ^& 
ase. may be det^rf bytoctod^ ^ ^ 

meffidendy f»P^8 ' XaferVUrge ntimber df cy- 
ftuomcence incr«aM only alW a^ars a ^ 

eles _ inany more than are necessanr ™ 
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(miorotttorotnpe), 
procba»cJ wt^ a specific 
monoclonal antibody, 
2x1 hour incubation, 
standard range: 3-96 ng/ml 
detection limit 1 ng/mi 
CV: intra- and intera^say < 6% 



SCD44EUSA 

Trauma, Shook and Sep 

L co,. .cccute , expre^ on-ths ^ of,.. .jw^^^S^ 
- : monocytes and seme macrophagea. M«mdrar,e, ; ,. CD-14^ W^oioSoS flukto 

'•■{UPS) completed m LPS-eindmo-Protcn (LBP)..T>>f ^say } 

• eonr^ion of its soluble form i E altered;^ i™^!?^:^^ ! ill 

certain pathoteg.cal conditions. Tners Is evideiics'for • 

t an important roe of with porytraumai eeoeie, 

: bumings and intlamrrwtlona. 
. ]r Ounng eeptic conditions and acute Infections it aearija 0 f-^ 
fe to be » prognostic marker nnrl is therefore of value jn' - .* 5. , 
Tv. monitoring these patients. ... > . 

For more Ihfartnati on call ° r tot 
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Olioonuclcotides with Fluorescent Dyes at 
Spp^Se Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

K«nn.,h |. U«»k. J A. «.«t )«ff~J M.TOM.. Wl.ll.m Oiu.ti, .nd K.rin D««Z 

PorkJn-Clmcr. Ai«dlfd UHwy»wmj OW-on, V»>ier Uty. California 9*404 



The 5' nuct«w*o PC* d**«*=e* eh« 

Accumulation of specific PCR oro«J-«A 
by hybridisation and cUkb9< of ■ 
douhle-laheie«< fHionopcnlc probe 
during tba amplification reaction- 
The probe it an oil eon uel noil da with 
both a reporter fluorescent dye a 
quencher dye attached. An Increase 
In reporter fluorescence Intensity In- 
dicates till t the probe has Hybridized 
to the tare;*! PCU product <**d 
been cleaved by t*»e nucle- 
olytJc activity ef TaQ DNA polymerase, 
tn thl* itiidf- probes with the 
qyaneker d>~ *tt*chad to an Internal 
nucleotide ware compared wttft 
urol*«> with the quenchac- dye as> 
uched to the 3'-cnd nucleotide. In all 
cases, the reporter dye was attached 
to the £' end. All Intact probes 
showed quenching of the reporter 
fluorescence, In genera I, probes with 
the quencher dye attached to the 3> 
q ad nucleotide exhibited a larger sty.- 
,1*1 in the *' nucleate pch assay man 
the Internally labeled preba*. It u 
proposed that lU« larger tiff,"*! I* 
cauicd by Increased |lk«Uhood 
cleavage by Uq dha potymeraic 
wh«n the probe li hybridlied to o 
template strand dwt-too *CR. Pr-b*a 
With the gucnUier dye attached to 
*fc„ a'.-nd nucleotide a'so oxhlbttcd 
on Ucr^ate In reporUr fluorescence 
InWo.Uy wK,n tryWdlead to a corn- 
pkmonUry *U««d. Tl»«**. ollgonu*!*- 
ottdes with reporter «nd quencher 
dy«t Attached at opposite ends ean 
be und ai homooinvoui hybrldUe- 



jrV^icjmoceriCOu.r as*ay fur ovtocOnj; 
iiiii w^inhuiattosi uf spccllic I*<:k prod- 
uct u^s a dciublr-iabclcd fluor> 
Kphiiprbhe wai dw^bed by Leo ct ad/ 1 ' 
Ttc assay exploit! the S' • • 3' iiuelts- 
Ojyiic ; aciNUy.; «f :V«3 l>NA poly- 
nd Is dU^raiuvd In lieuve U 
"H»c lluoi'ogtn;c (utilje cunsttfty uf an oil- 
/fonudooUda wjlh u rvportttx fluorescent 
Ore. >uv It « « . fiuoiCbtvtn, arta«'hed to 
Hie 5 ; ciul; iiud u ^UCWthcr dye, sveh OS e 
rtioiinVihc, aiucicd Internally. When 
Hi* fluorescein Is excited by irradiation, 
lu fluorescent ctnl&slOit v/ill he 
quenched ii die jlmOumltie ts dp*« 
cnouith to be cxclled through \h< p/<- 
ccsi ? o l riu orescci cr kt «r tr« nM tr 
(H Eqr^ During PCH, If the profcc is liy- 
bridled lo i UnipUU ktiatiid. Tao DNA 
polymcnwc , will cieavc the probe b«- 
ciusc pf Us inhetenf A* 3' nueleul ytlc 
actiyUyr7f the ct<#avjujc occurt between 
OK^nuurcscein and rhodamlnc dya«. it 
causes a n incrcw in fl uo i stfvvtri fluores- 
cence Intrnsity because the ftuor««o»;n 
\s lio lohuer C|Uerict|CCl, The UiCietise in 
jnuurusceln fluoccscxnce intensity lndl* 
i but Xiic probv-spccUlC vCX ptoduOl 
has |»V»<^tHnWdCHl; r\\\m t PBThetwan n 
teiwuici dye and c ^uciU'htr dye Is 1*1*1- 
eai io Uic jw forma ulc uf Die jjivAic Iw 
the S' wucW^- l«CR oviay. 

j QucnrKin^ i« : .e«fnplelely dependent 
on f the..jVUy^! ^0*1!— iw ol lh« two 
dyes.* ^ Because of thla^ ll lias U^wn ■>« 
*%ii|^jAh«tvUh'c^quaiclicx- dye mu>l W 
ifliciiea Vneai tlie 5 f end. SuJiJiislusly, 
have found Ihftt sHsx^Uq « rho- 
tiatUliit^yt ii U^f 3* vud of « uii*U 



l*Cfla«Wiy. iMrlhCfmorCi cleavage of Uils 
type of pniba is not required tci acnievo. 
5omc redaction In quenching. 01i K onu- 
dcoiidc with a rcponcc dye on the S' 
and nnd i quencher dye. or the 3' end 
ttxhfhii a mucn UlgTier reporter fluoroi- 
cvuew whan dnuOlc-siramlGd as com- 
pered with ^nMle-strondco. Tiii* should 
make u pcwslWc 10 use tni> trpc of dou- 
bls.Ubeled. proDc tor nomogencous dc- 
lection of nucleic acid iiybnumnoa 

' nCATC RIALS AMD METHODS 
Oligonucleotides 

Tible i shows ihl- nueleoddo Joqucnce 
0/ ollKonuclcotlcUi uAcd In 
suidy. Uiiker arm nucleotide (LAK) 
phoiuuorarntdlte was uhramed £/om 
OJcn Rcacofch. Tlvr standard DNA phos- 
phoxamldiics. 6<arbcorynuoresccln (ft- 
PAM) phospnnramidtte, o-carboxytet* 
wmethYtrhodaniinc sucdilimitlyl esicr 
(TAMfiA NMS e>ur) r and iDOsphalmK 
fa? aUechlng a 3'-WocWnB phnspnatc, 
wrre oDUincd hum PnrVlu-Eliner, Ap- 
pUeu aiosrstcms UhiaSon. Oligonucle- 
otide synchests ^as pcc/oiined using on 
AB1 model JPatiNa cyntncMtet tAppMcd 
Ul\wyiwrru). Primci and compiemtint 
wUivn-uelenctdes were purlfiCO using 
OH«u ruitneaUori Cuflrldxo (Applied 
Blo&yiteim). l>knjble-lal»t:tcd f irnlnrs ercre 
..ymhrsbra Willi 0-I»AM« labeled phov 

wlMv.^uiittUealtltf.S 1 w»0, 1 AN KJlladiqC 

of 11: uf tbvTs In the iocptenca, and rtms- 
phalmfc ai the 3' end, TolWuig d^ 
pfiiiriikiii aim cuiitftul pnxlpltBtiuii, 
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FiGUAl 1 Dlef/am of 5' nucleate away. Stepwise rcpicx:mRrttj« C( Vbe 5' -* 3' nucUfolyric ac- 
tivity of rag UNA polymerase acttnit ua a fl\iorui;cnk wrtic durjiiK «»k* eatonriuu plioK vt WV. 



Nft43lCdtt>OUftlC buffer (pU 9.0) ni 
room temperature. Uurcacicd dye was 
«I'D,1Q ScuW 
Hex column. PbwNy, I he double-lofeckd 
prubc "»nro3 purified ('Y pr* p«f nti^c hi^K- 
pcrforrnance liquid chromatography 
(IJPUJJ u.Ung an Aouaporc l„ 2211x^6- 
mm column ^tth ?-^m partieU sie<>« The 
column wa* developed vrhh a 24*m)n 
linear gradient of 6~2tffe occtonUftic m 
0,1 m TJLAA (tri«,*thy)ainitu« aecrtate). 
VrobejL ar« named py designating tha *e- 
<\ucncc (coi\\ Toblc 1 and the poatiioiierf 
the IAN~T<\MRA roolcTY, l^r e*omple< 
probe Al-7 has sequence At vrith LAN- 
TAMRA ar nucleotide position 7 from thr. 
.V «n<t 



P<ft StjIchij 

All I'CH ajnpl/ficaUom v/err performed 
in tbc rcikin-Elraui CcneAmp PIIH Ny*- 
teui ytUO u«in5 SUV! rcacUonc that con- 
tained 10 mM Trt.vHO <p» {J/J), 50 him 

Xgi, 200 p-m dA'ir . 200 dcri', Sux> h-m 

dCVTP. -400 dUTP, 0.5 unit oi Amper- 
age uracil N-elvcoavLaae fPcrkln-iamer), 



jrene (nudcolldcj 2141-243S in in* se- 
quence o/ Nakallinfl-UUflKi ct »nu 
«iiipliLltrd uj'ii^ (nimeo AW' add AlU* 
CTafeW 1), which a« modUkd tfljhlly 
from tbo>e o/ du DrcuU et nl,™ ActJn wn- 
pUf lea Hon rcactlbna comalncd 4 
w^c 20 rijj ot human genomic J5NA, 
50 nu Al or. A3 ptobe, and JU0 nu each 



TABLE 1 Sequences of Oligonucleotide* 



primor. Th« Iharinal x^lnwn SO^t: 
mln^, 95^ (10 mln), 40 cycle* of OSV. 
(20 atC3, 60-C (1 iiiln), «nd hp|d c< ?a»0, 
A «egnii;iil wai MIipHflcd troui U 

pJasmld lltat of » »cf{mfcni ni \ 

UNA (n l!C icotlde8 32,??0-a?»7->7) in- 
serted ir. the Smfli IIW o/ vector pVCl 19. 
'lli«6c r^vlluiih uilHuUUHl U ium 
M fi <:l 2< 1 n<< o^J^"n4d DMA, 50 n>4 FZor 
P5 probe, 200 «m pflmotf" VHP/ and 200 
<u»< pjbnoi The thermal 4c«Jmrn 

Vvas SO"C C2 min3, W-c Oo mm), 2^ «;y- 
ciel Pi (20 »cc), sre 0 min), nntl 
hold a\ ?l*C. 



HXixnrt&cencT D«t action 

Vor ctioh amplification roue lion, *J 4tLui 
allquol of * sample iraoslerrcd to an 
individual whW of awhile, Stf-woll micro, 
tlur platd (PttWu-UVmcd. nuorcxecrtCt 
waa moaaurod on th£ PcrUln-tJmcj 'W<\- 
Mai: L.S'.^fUl Jry»Um i ^hlcll conoUu c»l A 
lumincsconco eptK^romctcr wiih pl*»« 
reader aaietnbly, a 4B.1-nm cxcttaUon fll« 
tar. And a MA-nm amlitlon 'H'.tr. Pxcho- 
Hon was. ai 48R nin u«ng » 5-nm $Ut 
widih. tmfcslon waj mcwureci ai 518 
nm for o-VAM (ihr. reporter or U v«luc) 
and «U nm for TAU1U (tn v CjUcnchcr or 
Q vkU) v>in ft & IO-nm «IU v/ldth. Td 

determine ihc lm.ica>c u% »^|.«»U;i wmU- 
ilon Ihai 1.1 caused by eJeavnge o( iUc 
probd dtjrli%^ PCU. three n(nrn*^r«ationa 
aic applied to the iaw *nu«j»itf*n dju«, 
Hrat, emtaalon liucnahy of a buff*« hUnk 
Is subicac\cd IVjx each ^avclcnglli. ate- 
end. emission Inten^ry of the icpoiicrls 



Kama 



Kl 1<>. 
W 
11C 
PS 

AAV 
Al 

A tf. : ;.> 
A5C^ 



uninirt 

r pilxneT 

complement 

probe 

Ctfinploiifai*^ 

primer 
pnnicr 

-.pnjljc 
■>: coni|j]aif)orll 

vuoipl«tn«ut 



ACXCA<^ClOAA<rKlArCACCACrc 

A-rcrrpjcorrcCOOCiv^Kn-i mtjr: 
CTACixjcrrc goacx)atca<:taat<io<uiO 

COClA'ITrGClXKn-AIVXAlXJu^AACCATp 
TITATrXTTTCTr^AeAl At^OCAAA l OCC 
TC>CCCAO\a OTOCCCATCTACCJA 

AJU CCCiCCXXXAXC 0CATI CCf^COTV 

At^t^^^iiiiA'n^it^ixstJCJUi^^^^^'ACi 
corrn rtcAcrrcx aocaaoaoAxv 

rr>Tr^CrrOCTT/lAAQTOCAO0OCCU 0 



Tor eicnrgiuoriuacutlUe w;^ In Utb atudtn the n«ci<lc »dd^«Kt b 
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2<M 
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M 
• an 

.TO- 

nee 

nb 
ua- 

Mil 
>IM 

uir) 
rar 
ro 
1U- 

l*K 
XLS 

Ha. 
«ec- 



TO 



A1-3 
A1-14 

A1-10 
A1-22 
A1-3C 



C18 rtfn 

la mp. < Wmff. 



ftCV 



M.012.1 0S.9»(U> 69*- X« O.60 2 O.0£ MO-iOOC 

8».oi.^ »6.i a :i> toe.6H.fi> hm^m ..^••ft ...-W* 0,0 rLMsa,R 

U7.o*Ai> m io*.»±s.j oxw ^ l u*o»^i*io.u imam 

1QTC*1>9 iH »,Td 7t»^<^ n.O*OX XC7»0,.DS S.MJ.C.U 3.133 CIS 

4aa.titt«a.« iaao±«.Q mxio.o C26i0.oo icsxc-.n cr/iO.u 

I602ja3 ltt.l*M VL'iU 1.1k M1*W» ******* 



A1«3 

AH 4 
Ai-<9 

A1-33 



* M«dt Uairuh Lnd MM**.- k jt *mpimcatiqn, containing jhc In- 
*«udwob«t w*f e p«fcurv«d. the flunraseaice emission was neiwtd « 518 Mid 
teS vsUi« .* STc m»0*1 a* for ax ruction, nin^out addtrftcmphtc (n o icmp. 
3? " Milan* run with implm ( ■ temp.), iny HCJ -otto was calculated for web Indrndvil 
action anU^cca K u4l to give tiu «*f>o*le<L RQ" ttq* vdutA. - 



oiviacU by the anbslon intvualty of Uw 
quencher to give an fcQ nuln &>r cacb 
paction lube, TUU normalizes tor -wrll- 
to-wdl variations -in probe ronnencrx- 
itnn and fluorescence measurement. Pl- 

* nany. aw4 calcinated oy Mitm acting 
the kQ value of the no-templ'du; cuOlrOl 
(R<3T) from the HQ valuu fur ti «? wirt* 

• pictc rcccnon including template 
(RQ, ' )■ . 

RWUtTS 

A senrs of prcbei with increasing dis- 
tances oerwecn me nuortuccm itvunci 
an J rhodamlno quQnchai wern tested tu 
investigate th» minimMA and rn^ 11 !^^ 
spanng mat vrouid sive an ncceprabtt 
performance m the 5« nuclease vd\ w 
sjy, Toefie prob« hybridize w a tar^a 



Ae^uen^c l« Uic hunian p-aa»n genr.. 
Fl^uic 2 ahoy* ihc icsiiiH of *n c»:|«3Ti« 
ineni In wltlch lite*: probes **rt In- 
ciudc^lii >r^, s Ou^ft^Uaed a segment 
of tlii; (J^tf ir ^Vi*-^*n : U*iiiUi3 Ihc UlgCt 
3v^\»oiwr. IVnfufiiiUHfli lii 5' rtWM 
clea3e PCIl ***<\y la twHiHwrol liy 
m^^nUurtc of AKO^ which Ij q meaniir 
Of the incr«b»'« tn reporter nuureMVivT 
u»u«cd by PCR »mpJl<i«»*»on e»f the 
probe wr^ct; r™b+ Ima « AUCJ v*1*j* 
ioa\ A5 cloae. to ixrti; Ui41ei>tlng thai ihe 
probe was na\ cleaved Appnrcubly ilw 
trig the aunplUleiiKMi r*«ci««i. Tlib iug- 

KC»U thfcl TfHll ^u-n^ivex oy« q« Ut« 
secuild flUCleOlUlC from the S 1 t:nii f tberc 
ii liuurncleiit iuuim -IW Tav polyincfajt 
tu vleoye effU.1%aUy between the rcpottcr 
aad queneliei. The other five prohw eoc- 
hlhllcd ecmvsrabio Ak<i valu»t Uwl 

.;■}■. >A /-.' V;;,-.'.^;-. ! --+«! 



clcfirly different from rcro. 'n»w, «1l flvo 
probca tff hernR elc(rvrd tftinni; K:H am- 
lillAeMtlun icwjUinftiu b alinUdl Uiercaw: 
ill JCpvjHcj' Huu*>»>uuiv;c. )l fdiuwdd Ue 

noted thnl complete digestion of o probi 
pxciaucos a much larger Increase in ier 
mortar Ouoxttccncc umi Uiat observed 
1a W^v*c 2 (dau not ehowo). Thus, even 
in rvactlvm ^herc « in pi if "all un occur*, 
mc mflionty probe molcculce reinulu 
untiicavcd. K is nuinW rof im« reason 
lhal lUe flxjofwccncc Intcmhy oX the 
quencher dye TAMHA elionfic< lliile with 
ampllftcailun nr the iarge.i t ThU la what 
.>llaw» os to u*e rhc iOX-nm nui»iesccn<r. 
fca<ilng * normilUMtloA factor. 

Thr TfiiijnintH'- nf HQ* H^nrf* 
mninJy on the gucnchiti|« vfrlttfcncv in- 
ner«n( in roe spwanc .itrucmre ni the 
probb *nd the purity ol the ollgiuiude 
utide. llws, th«i Urgct UQ' valu»* indi- 
cate that probes Al-H AM9, and 
a\-26 probably have rcductd quenching 
2s compared witn A1.^. ami. me deRrw: 
of <juc^ehlng U wfficLent to detect ,a 
hl^Kly Mf^iHcant ln««roMmt In rcpoftur 
flu«ire>eeiice *»K«n «?«iclt iWww probe* 
ia eleavtd during PCR, 

Tn further investigate the ability of 
TAMUA on tne V <-na to qu«meJi CPAM 
on U2C 3' end. ihrcc nddillonfll pain ot 
probci were tested In the 5' nuclease 
PCR posay. Foi #ach pAir, ooc probe lias 
TAMJIA ottoehcii lo «n int«r'v»l nueli> 
ullUc and U'»t olUei hw. TAWIU *LUdicd 
to the y end nucleotide. The tcaulu are 
ilxowu la I'ablc 55. Y'OI all Mirer, sets, the 
prob« vit«h the 'i' quencher exhibits a 
ARQ vnlxie lltal in considerably liifil^' 
thaii fvr Ok probe with \hc IrHcm*) 
quencher. The HQ" value) sngSC5l lhal 
<Ufr«*nc«e U qu^nehi t nfj «rc not « 
oi tboac observed >-»th >om^ pf Oi* Al 
probci. T1«ae roulia dcnaouatrale thot a 
quencher dye on the 3' end of oligo- 
nucleotide ran quench efficiently the 



TABLE Z K«U. C Of V m*m *™ Com^s ^ ~M TAMBA f >n mt<r n „, or ^ NuoW.U, c 



Sift ton 



mn 





Prob« 


no Tempi 


+ temp. 






7l.\ t 2.9 


B4JZ3.7 
036.5x11.1 


the 


1^7 




3H4.U* 34.1 

d 14-1 


»cc 
»u- 


1-5-1.0 


77 J = 
*1,0 x 3.Z 





+ lemp. 



1VQ 



116.2 

103.1 X C.< 

.1*0.7 rftii 
Kai.ti t a.T 



. 00.2 J. 3.B. 
* 10.2 

lia,7 = 

V5J1 t»J 

9i.7 = e.3 



0,86 A U.W 
0^9 1 0.02 

aw ± n -a* 

0.B9 * 0/)t 
CLA3 ± 0.0X 



0,71 x 0.0^ 
2.61 - 0.(35 

,vto » aid 
a/^ r (Vio 

? 0.06 



vat* i» o.n^ 

2.40 0.1 6 
3.68 t 0.10 

1.66 x 0.O8 
2.891 0.1 :i 



ui./ l / /iuuo xo:ou f AA ODU 0^4 uooo 
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flu«TWK*rncr of a mpnrUr ilye on Ihe 5' 
mid. Thy Owyruo of quenching li iuffi- 
cical fo- Uii* tyyv of oligonucleotide 10 
be uird as a probe In the V nuclease PCR 
a*say. 

To t«t tho hypothetic thai cp#o«rhing 
by a 2' TAUHA depend a on the fWUlity 
oJ the oligonucleotide, fluorescence ma 
mcaMjjml fur probes in the single- 

SUinded flild duublw *trandt««l state*. To- 
hi a % < rep©*U th« fluurwcenva ohcorv^ 
at SI H and $52 Jim. The reJatlvc derrte 
of quenching la assessed 1 by calculating 
thr. RQ utiti. Wir probes with tamra 
MO nucleotides ffom the S* *jiJ, tliurv 
Is little difference In the JIQ value* when 
curnpartnc siu£le*uan<U:<l with doucle- 
stranded oligonucleotides. The results, 
/or pmhef with 17VMRA a i the J' vnd are 
much different For thou* probe*, hy- 
bridisation to q complementary strand 
causes a dramatic Increase In HQ. We 
propose ihar rh(.v toss of quenching js 
caused by rne rigid structure o* double*. 
Slrknded UNA, which pr*v«nr* the 5' 
and 3' ends from bclnx In proximity. 

When TAMra is placed toward toe .V 
(mil. thOM U 1 markcid Mg* 1 effect on 
ou«nchtng. Figure 3 ahows a plot of ob- 
served RQ values fur the Al series of 
probe* ax 4 function of My aH conccntra- 
tlon. Wltn TAMi£A attached near the S r 
end (prnha A 1-2 or A 1-7), the UQualue M 
0 no* Wy^" Is only slight ly higher than 
RQ at 10 m M Mg* * . Vat probes AM9, 
Al-22, and Al-Zfi. the RQ values ar DmM 
Mg*« are very hinh 4 Indicating a much 



wuJuocd quenching eCHcloncy. Tor eneb 

of -tW jarobei, lh«i« U tt marked do- 
crvaac in Kl^ a I I 171m Mf 3 1 follwcd* by 
u ^rududl decline as 1 he 1 tuuecu- 
trution inrreatei to IO mu. 1'iubv Al-14 
showi an intermediate JtQ va| u </ at o m>- 

M$ 7 * w;ih 1. ^\iuol decline at Mayicr 
M^' caiicenUtttums, in a low-salt en- 
vironment with 11(1^^ - present a »rv 

gU«ile**uUr<t ©JlgonticiuOlJue would bo 
tstptrteU tv «dopl «i4 vAintU^I von for- 
inaiton because of cketiortailc repul- 
t -slcin '^tlisii °^ Mg 4 * Ions nets hi 
i)Uc] l1 .the^n^aUvc diargc of vnc phox- 
{itulti backbone so> chat the ougouucJ*- 
otide can adopt confer/nations where 
Ac .v end is close to the y end. iiiere- 
fore, the observed Mg 2 * cflecis suppnn 
the notion xhai qucndiing of a S' re- 
porter djv by TAMRA at or near the 3* 
end depends ess tbc flexibility uf Uie o!l- 
gonucleorldc. 

DISCUSSION 

The strlWnc flndhifi of this siudy is that 
J\Y^niS,-.llTie/ rtiodamine ciyc TAMKA, 
pUeed at any potiiion Ui «n olj^gnuclc- 
otidCj eso quench the fluorescent emis- 
sion nf a fluorescein (0-i ; AJvl> plarjpU at 
•ihe: # ehjCl;^hls laipu« inai a smfjH-- 
itiri r^cii/ ^ dcmbl i.labof <-d ol i^cnue le- 
ptidf muft h* ablo to adopt confonm^- 
tions Vrhco* tha TAMRA Is el«ae to rhr. 6' 
ejicl, !l should he noied that Uie decay of 
&1<AM hi Hie excited siatc requires u cer- 
tain Amount of hmc. Thztafor*, whdl 



TABLt 5 Compori>on of PlunrcAC-v»<c f<iiUa^u^ yi^liielc%iTr«ndcd and 
Daublo»lrondcd RuorogcoJc Prubc* 



50/ Tim 



RQ 





•1 


d» 


01 




M 


da 


Al-> 








133.1 Jt 




1I.-V3 


AI*2A 


a\.4i. 






93 66 






A .*i • f% 












<UB 


A.'^i4 


3ans 






mo* : 


u*s 




n.i 


is.02 




A4,o.) 


.121.00 


0,6« 


0.SS 


V2*17 


aojio 


aao.4^ 




: At.Ti.r 


0.^1 


s.as 


I'S-IC 


a7;u 


144.&5 


61.$»S 


16slS4 


0.4« 


O.K7 




33.^< 


4ca.2C 




;'. ; i'oi;4i - \. 


0.46 





C15) Slnglc-jwandrd. 'I'hr fluurf kwjjie zmiiriom j\ 41Q ui »182 ruini for solumins rontfllnlnx d filial 
vtinccntrotlon of 50 jjm iiullcated probe. 10 mM Tiei-lfCI (pll '93), 50 rnw XQ. nod 30 inn MgOl*. 
(di) Dtiublc-Ntmndrd. Thr, ^nlurlcins combined, 1.n,flddiijO(V..100 nM A1C fur pruhr< At-7 and 

a wxi, 100 nxt A3c far probes A3-6 ana a>24, ioojim l*lt: for iimbes 17^7 ?na rE-?J. oc 100 nw 

C /ur vrcibi^ w-io an<s l«4-a». ncforc mc *o<ii\io»i of Mic\iiirJ 1*4 or edCU iHmUc vrzt nuanrU 



mattara Tor cjucacblu^ u hcji the 'av/cra^c 
di(tanea between 0-1-AM and TAMIvN 
tour, mthci, how close VaMM can fer lc 
0 « ham rtuiiog Uic htfcome of inc o.hAM 
4XCUid uatc. As long -* Uic Uway time oJ 
tJxe exciud state relatively lonp; <vjn- 
parco with Ihe molecular inoUons of the 
oligonucleotide, qucncnln^ can occur. 
Thus, we propose tuat 1 AMRA at t)ju 3' 
end, oar any other yoMou, can cjueiuh 
e-fAM at the V ond because f AMBA Js Jn 
proximity h> fi«VAM often enough to br 
able to accept energy Transfer from an 
excited 

Dq tails of The (Juoi6sc^nc« measure- 
racnb remain puzriln^ Tox example, Ts- 
ble^ shuws that hyortalxation ofpiobcs 
Al-26, A3J4. and l J ^2H to their comple- 
mentary sirari<k not only causes a tofflc 
increiiae in 6-PAM tluoteicence at 518 
11m but also causes a modest Increase in 
i*AMUA ftuoresrxria^ nin. U 

'j amu a is boing excltwi by <«necgy trans- 
fer from quenched C^VaM, then loss of 
CjuenvhinK atlrlbutable to hybrldlsaUon 
should cause a decceaxe in the fluorcv 
ctmcK emission or tamra. "I7ie fact that 
the fluorescence emission of TAMRA In- 
creases indicates that the situation U 
mot p complex. I^r example, we have an- 
ecdotal evidence thut no oases of rhe 
oli^onuciaotida, especially ti f quench 
the fluorcsevnee of botlt and 
TAMHA In some decree. When double- 
jtcaitded. basc-paLfi/uJ inay reduce the 
abntfy at the oases to quench The prl- 

uivry factor causing t)»w o^Utnching of 
0-i*AM in an intact pcobc i» the TAWRA 

dye- KviUcnce fox Uie imponana ol 
TAWIU it ihaL 0 FAM nutiKM^nce 
remain* reUUvety unciian^ed v'ficii 
^iob«s WmfiftU ouly wtch 6-l'A*M are used 
in the S 1 iiucImsc 1'CR a»«ay (data not 

sho^u/u. secondary ctrcaon of fluorc?' 
o?nee» both before and aAci deava^c cf 
tha protx*, need to ho explored further. 

He<cardlcs9 of the physical mocha* 
nl,im, rhe relative independence of posi- 
tion and quenching greatly slnipllflea 
in* design of probes for too S' nuclease 
I'CR aa.iay« Thore are three main factors 
IhaJ dcienmlnc the pctfocmance of a 
double-labeled fluorescent probe In tlic 

nuclesse IKLTL ottay.Thc first factor 1» 
the decree of quenching oliKtrved In the 
intact probe. This la choractcrired by th<* 
value of TxQ' , which Is the ratio of re- 
porter to quencher fluorescent cmls 
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mM Mg 

t*u hci fan fcjf SO fiiM KC1. bT^n vary^nc amount C°-10 m*) orMjcU* ™ c»icv«w« «. * 
J^llS^tSl? »v MAtim b.imMtr) ure plolN «. vfgCl a concentrate (mM 

M n }. '"i* W-y <up//w /*W> >h»w* Hie |nutft» vuiuliietl. 



dye? Used, npociofi between reporter *nd 
quencher dyes, nucleotide scqocnec 
content cifccui, preftcn.ee or ftlxuctutc or 
ulhci fftclv* tl»l reduce fliiuViUly «r 
the oligonucleotide. ±nA purity of the 
probe. The second facior 1* ibv. efficiency 

• t.r liyhmlfealiUil, Which depend* Oil 
probe T mr ?«**cnc« of ieeon« *ry slruC* 
turc In probe or template, annealing 

• temperature, and other reaction condi- 
ilons. The thUrt fociov IS Uic efficiency at 

• wiilch 7«g DNA polyrotrwe dewes the 
bound pxobc bclvrccii the /cpoitct and 
quencuer dyes. This deava^e la depen- 
dent on sequence comptcmcnUrUy be- 
tween piobe and template hs jhavjn by 
vlic obociv6tloii lhal mismatches in the 
teenicnt between reporter and quencftwr 
dyes firaattwMy cctlucc \Ue acuvoKV uf 
|ncUc. w> 

'Hie rise in RQ* values for the A3 so* 
nci of probe* secern* to indicate that the 
rtefcrec of qucnchlnc rcduwnJ aouw- 
wnai as ow quencner is jiidcud lowaixj 
the 3' witt Iho apprent munch- 
ing Is observed fof ptobe Al-19 Fift. 
3) rimet rhan for Die probe where The 
TAMRA U a\ VI* ^» «nd (Al-26). iTllS W 
-luidtrstondflble, m the conCormaiion of 
the 3' end position wcmVd be expceicri eo 
ba lees resinned r^au the oonlurm*tion 
of an lnurrvu po»won. in effetx « 
(juencher at Iht 5' «nd Is Ireei to adopi 
ccnlormflUons close lo ihc S* reporter 
Uy« than Is on internolly plucwl 



pfol»^, ilte Interpretation of RCt values 
ij lea* cl«ar<uL The A3 prolics ihoW the 
same trend w Al, itllh ihc 3' TAMRA 
piube bavlii^n Urjjer JIQ.* Uian Ow 
ivnai TAMAAijpefae. l^or the ^ p«U, 
WotK pniliw haW *h«ul *h« same KQ" 
valut. For ihc PS probet, ihc RQ W Unc 
X probe h lew ihon fr*# ihc lnic/.»«Hy 
IfloMeo piobc. Another (actor ttvn may 
wrplnin ipmc oi the oh.<rrved varloHcn 
thai pu>lry •■ffoda;Hi» UQ - value. 
Oiou^li all prober or* HVI.C yuiitimdt «l 
imall amount of cohlaminotion with 
uncpioneh od reporter can have a large ei- 
fccionHQ - 

fcel cm degree or quenching the posl- 
tion. uf Uic qucnilier ap|«arc/«tly \mu 
lirtvc 6 Leu^c effect cm the cJTicieiiry of 
p^bc.c^w!||^..5Fb» -Ki«a4 ^r*»tic «floci i« 
OUSUrvcO Al-2, wJaePfe pUce- 

mcnl nVin^^KA on the jvx\«w' nu- 
Vlrullde irduuei ihc cfflclcj>cy of cleav- 
age to almost *oco. For ihe AS, 1% and VS 
phlhe>; ^Ur^jii roiayii Kn^l^f Uta 3' 
TAM1CA prohM as conipaied with the in- 
tiunul TAMRA prnhts. Tll\S 15 explained 
mosi costly hy aammlup tu«i p*obti 
Willi TAMRA aV thc 3' end are more \\My 
to t>c cleave*! lietwecu lepwlci and 

quench*t than arc prohes with taMKA 

Oltfl'ched mtcnwTry. Vor the A J prober 
the ekovupo atpcitmey ol prone 
mus^already he qutie high, ax A^Q dect 
nor tfl£rewr v - wriein the quenclier w 

i .t.!l./««i ^iArM« TA rH» end. Thin illOJ- 



iraln ihc impotti"^ Ahlr> |J) 

use probes with » q«*««hor on ihc x 
end in tlic 5' nuclease f*C« uiuay. In thin 
M iay, an inercaofcin tnc intensity of re- 
porter fluorescence is ob*wved only 
whiill Hie probe U ck-anrd bvlwci'Ji Hie 
leportar and quencher 6ya Dy oincinK 
iliu iupwUj# ^»iu qaeiidn-M dyuk un iho 
oppoiUe «nd» at oHf/>nudecti<le 
j>robc. any cla*v»«« thai occurs will be 
lUnccteil. When Uic Quencher U utUietunl 
lu an ULUiriuil fiucleuUUo, BOmelliwo Urn 
juviU- wucV. -d! CA^-7) aiitl "Ihcr Uiilea 
nol so wtU (A5-6). Tb<: raladvaly poor 
performance of probe A3-6 prewmably 
mcaor tho piobc U being cleaved 3' lo 
ihc cuenchcc ia»ha* than ootween vhc 
rppnrirr «nd quencher. Ther^orif. the 
best ctiancc of havlni: o prohc that reli- 
ably detects accuniinatloii oj ?CH prod- 
uci in the .S* nuclease t'CK assay Is to "it 
a prone with rhe reporter ond quonehor 
ilye* un opposite end*. 

Placing ibe quencher dye cn lh« 3 1 
otid may alao provide a SlXtfW boncflt In 
terms p< hyWndl«itl«n eflioency. The 
prcjeucx oi a qucttchcr atucbed to an 
Inlem&l nudeolldc intRhl tie cyspeded tu 
dUrvipl Uwn-painna afiil reduce the T rtl 
of a prohc In foci, a ITt'-Ml rudnUiuii 
In hns been observed tw^. p f obci 
wiUi tjilttiidlly atuidied TAMKA». ,,fl Th* 
dlifupti^c cffecl would be minlrokcd by 
ploclng the quencher al U\c 3' end. Thus, 
pnibra v^irh qucnvJtcn might ejthlbll 
Mlfthtly hlRhcr hrbridir^tiun effldcncia* _ 
Uwo piulaes wlOi t/iteriiiJ qycjiclien. 

Tha combination of increased deav. 
age and hyloHdiaallr>n clfielaticiei means 
ttiat probes nilh 3' quenchers probably 
will be mote tolerant of niismoleiio be- 
iwccn proV and target as compared 
r*ilh ^nlen^ally labtlad pfoliat. Tliis tol- 
erance of enumaiehej can be edvania- 
geoua, as when trying to use a jingle 
probe to detect PC.R^iplificd product 
fruit mtiKptt^ uftlilTcfCiM spCClCS. AlSO, U 
mean's that eleavagv of pnjbe durl PCR 
U Icis aciidHw U> olt-i-ution* 
acalln« temperahsrr or other feacUou 
condliloni. The one! appHuatlcm where 
tolerance of mUmatcru* may De .1 disad- 
vantage. i.i for allelic discrimination. liJe 
ct at °» demonsUatcd that allele-iijHiclUC 
proh4f wiirc ClCftved herween reporLcr 
and qu<mtf>cr onry.when hybrlclizca to z 
perfectly complementary rargcl. This al- 
lowxl ihein to alstlnKui«;U tJie normal 
rromafi cystic nhraiia ^Iclc from ^ th« 
AFS0B muant. Their probea had TAMkA 
^ruchad to tnc seventh nucicoUdc ftotn 
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dye* uied, ipaccag bciwrcn reporter and 
quencher dy«s. nudftoUoo swjuwice 
rm>*xt effect presence ol smictuie or 
other actors that reduce flexirnuy of 
lha oUgoxuictcottnc, and purity of th« 
pcobe. The coc*ind factor IS IUC efficiency 
o( hybrialwtion, which depends on 
probe presence o< secondly «f uc ^ 
turo in p«h< or template, annealing 
icmnerahirt, and oihar reaction condi- 
tions. The third factor is the QTiicioncy ac 
vniurn *W UNA polymerase deavci in* 
bound probe between the reporter ana 
quencher dy«. ihis cleavap* Is depen- 
dent on sequence compUanfcnutivy oe- 
rwrcn probe and templar* » shown by 
the obamwton that mismatches in t*e 
leamant between reporter and quencher 
dyes drastically reduce till citrons of 

P«>ht, ft) , v M «w 

The rUw. In KQ v*hjc3 for iha Al «> 
ri<* of probes seerrw to indicate that the 
<tauta of quenching is reduced soaw- 
whet as the quencher is placed toward 
tnc .T rmd. The lowest apparent quench- 
in* is observed (or probe AM* (sec hg 
3) xa«her than for the proo* where the 
T/vMHAisatthe 3' cx.d(M-26).W«u 
.undcrsianUiiMc. as inc confozmaiton ui 
the S end position wuuld be expected 10 
be lew wtrlctod than the conformation 
of an internal position. In effect, a 
quencher at the 3' end is froor 10 artiDpt 
conformations close to the S' reporter 
dye than Is an internally placed 



pivl**, Oil! liit«rjJt€iaUnn of lUi voVica 
is less dear-cut. The A* prober show the 
5-me: trend a, Al, with th< S' TAXWV 
prohe. havine a larger RQ~ *™ T, ai ;i ,u T 
i<vrnal TAMlWv pwVc- :Vin H« r2 
botii pioboi hava about t»t same RQ 
«Mtir. Fnr the M pr«Kj ( U.c HQ" for the 
y nrob« ts less fv. U»< mltMwny 
libeled probe. Another factor that may 
c^Jain some of the ohservr^ voriaHon \s 

Sit purity atfccta *** A h \ 
thcnish ali prober ate ; HPLC P urllU^ 

imaU amount of contamination with 

\m<jU4rnclied reporter can have a lersc 

feci on KQ „ 11# 

AUhOUgh Chcxc may 
feet nn decree of quenchmg, ^ 
Uon of the quwhrr aPl«itnuy can 
have p large ctlcci on. the efficiency of 
probe clc^^o, The most drastic Cecils 
observed with p«*i Al-2. wher* 

jmmmkum Mn^cyvi aw* 

qcctO klihoKlwvq, |or th* A3 r P2.imd^ 
prober ARQiu much gr^auif W rhc. 
TAMKA prohP,vas compurtd with me m- 
tennui TAMM probes. This Ly explained 

wuh TATwiRA «IU* 3« end ata mort 11 kcty 

nucnchcT thaa ate pmbtfs ^vlih 1A ^^ 
flttadud loiexnally.:?or the Al proben, 
the dcaviu^ effideocy of probe * w 

m^i» 

noi increase Whc n tlic " 
ulac^ clos^ to.ihe 3» end. Th« UIuk- 
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tiattos Uic lmpotSAAce ol boi«ifi able *n 
use probes with a <ju«nch« 3 # 
end in the V nuclease I'UM m»Y- In UxIb 
away, an in*#M«« ^h 0 mten^ty r* 
porter /luor»»e#nco Is oh«ifvad «oly 
Irhcn the probe U cieavod between mc 
repartcr *nd quencher dye*. ^ pW^ 
tho teportet and <ju«nchcc dyo on tiw 
opp<xlUi end* of k:i oUgonudcoUdc 
.»ruh«. *4«iw^ tliju WW«rt will \*. 
flexed, Whe-n the quanch*r u ittarhad 
io UUciiiaJ tuidwtMtf, .oinodinvt 0>a 
rirobe wArk* well and other titw» 

not 4to wi-ll H»« relatively ixmr 

pcifnr«^ec of p'ol^i AZ~4 f retumably 
maans ttie probe is belrm cleaned ^ to 
tha GUenehcr rather th*n herween the 

chance of b*vln« * probe that cdl- 
ably detects accumulatiun of PCR prod- 
uct in the V nuclease PCM a*8y U to 
a proUri wtth itv icpbrici and quencher 
dyes on opposite enos. 

rtaeln? th« quenche, dye on. Ute 3 
end may aUo provide . .Ugh bancfl in 
(WT nA of hybridization clllclene>-. I ha 
presencts of a quwichar ansrhed to an 
h«crn«l P.uwW.0e .nlAhL 
dUruot b^sc-pairinc an<l reduce the 7, n 
o£ a pro**, in f^ * ?<> vc t eduction 

•rahWady aUaeh^TAUlU^ 1 hb 
dumptivc effect would be mlnlmiatt by 
placing the quencher ai tht:« 4 «^™«; 
probct wim quenchcTs m^h cnhlbll 
illifihtly higher hyhrldizarton cffidcnclrs 
than probes with internal qunncho.rs 

The cambtnaUfcn of tnaeased desv. 
afcc and nyftrtdi»Hoi»cffld«ttcl« maanK 
that Pl0h« with 3* qvwehcrs. probably 
Sll be mow tolerant ot mismatches be- 
tween probe and r^t as compared 
with internally labeled probes. TO* t* 
erancc of mismatches wn be odv ^* 
rcoti), as whan trying to u« a slngle> 
^obe'tn detect POUampiif^ pmdum 
from samples of different »W «j£ 
m wnv that dcavacc of proU* during VOL 
u | WK ncndtivp to iillCTttlcins In an- 
neallnu tamp^ature or oA » "Jg 
condltltms. The one appliuUon where 
tolarancfi of mismatches may be a dlsad- 
vanta« U for allelic discrimination. Lcc 
lS"<lcnumauaied lhat allcUsspedfic 
proow were elenved between reporter 
In* Huencher only when hybad,«d to a 
.^focuy complrmcntary tancci. This 
lowad them to Ulstlnyotsh the normal 
human cyrtfc fibrosis ^C*™^ 
MSC8 mutant. Tlteff probes hed jTAMKA 
aciac^ed to the seventh nucleotide from 
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di« V end an<l w W <<<Ki$n°d to that any 
mismatches were berweun the roportot 
and lyncher. IncreaMnx the distance 
feet*-*** w»p*»rt«r <juor»ch«* WCWild 

lc»acia Ihfc dluupllw effect <if ml*- 
marches and allow eWava£« of the pro©* 
cm the lnCOUCCC targgt, Thu*. probes 
ti»Uh a qvicnclwr atttehed <o on internal 
nucleotide nwy stlH bo ucoful fa. allelic 
Hlsrriff»in*tlo*\. 

Hi llLU Study 1o« of quonchlng vpon 
hybridation will USCd 10 lhcrv? that 
quenehing l>y a 3' TAV4HA U dependent 
un tlw fle*tt;Wry ot a aintfa-wrandid oli- 
gonucleotide! The increase Us reporter 
JluoifttocnCf Intensity, TT30U*ti, could 
4I10 h« u*«d to d«ken\uno wnouier Uy- 
DridleMlon boa uccurrttd or not. TMjs. 
oligonucleotides Willi JNportsr and 
quenchet dyes attached at o|if»«aUe «n<b 
AhouUI alio be u«ral 85 hybridUatlon 
proUc*. 'i^ho ability to detect hybridiza- 
tion In real time meant f hat \IKic prut^x 
could be vstd ro measure hybridization 
kinetics. Also, triK type of probe could he 
used to develop Homogeneous rn/tirliU 

'.ration Mi«Y* dUgooi»4ie» or o*H<»r ap- 
plications. U*s*cil Ct *l. t,0) ttecrib* ju*t 
tills rypo of huTnojjcncoiis Assay where 
hybridization of a probe cause* on l«- 
truaia In fluofoccance cauaod by a losa of 
^uen<anln^. Mowcv«c, thty ^tiliiod a 
complex probe design thai /equit* add- 
ing nucleotides, to bot.h enda til the 
■probo aequinicii to form two imperfect 
Iwiiryiitt. Th« MiulU prcsontad haw 
vUniiuimww thut the simple addition of 
a r^poncr dye tt> one end of an oii&cmu» 
ricotldc aud a quencher dye lo inc. oU vet 
od gonweoc a fluavaganK proU; thai 
c*n detect hyb/ldlucti«»n or PCH amplifi- 
cation- 
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SIMULTANEOUS AMPLIFICATION AND DETECTION OF 
SPECIFIC DNA SEQUENCES 

Wts.ll HigucW, Carin Dolling, P. 6»» Wdd. »d Robot J»M 

1630. ,t Corr*flpo ruling aurhor. 



fttfe have enhanced the polymerase chain 
jWantian (PGR) such that specific DNA 
fcqueaces can be detected without open- 
n * the reaction tuhe. This enhancement 
squire* the addition of ethidiuxn bromide 
'(EtBr) to a PGR, Since the fluorescence of 
jBBr increases In the presence of double- 
Stranded (da) DNA an increase in fluores- 
cence in such a PGR indicate* a positive 
fiapiiflcattnn, which can be easily moni- 
ifiicd externally. In fact, amplification can 
ie.. continuously monitored in order to. 
(follow its pmgmw. The ability to simulta- 
Ktoiuly amplify specific DNA sequences 
f £ad datect the product of the arnphficatiou 
■both simplifies and improves PGR and 
/may facilitate its automation and more 
widespread use in the clinic or in other 
jqnaf&ons requiring high sample through- 

uhough the potential benefits ofPCR 1 co din- 
ical riugottcfes arc well known", it is sdl not 
widely uxed in this setdng. even chough K 15 
four yearc since ihermosrablc DNA polymer- 
fe* made PCK pracdeai. Some of (he reasons for 10 slow 
acceptance ire hicrh cost, Jact of aiuomadon of pre- and 
WCR proc«*iing steps, and false positive results from 
irtyovcr-conttminaucMi. fae first rwo points arc related 
labor is the largest contributor 10 crtft at vhc pres f ent 
age ok* PCR development Most current assays require, 
fae form of "downstream" processing OBCC thcrmocy- 
ix done in order to determine whether &e car get 
'A sequence vaj preianl and has amplified. These 
muA* DNA hYl^c^a-don 3 '". ae\ electrophoresis with or 
frfehouc use of traction di^raon 7 - 0 . H?LC*. or capillary 
^ r Wophorea» J0 . Thcac methods ar* Ubor-intcnse. have 
ft* throughput, and axe difficult to automate. The third 
jfcint is also doady related to downstream processing, 
-pte handling of the PCR product in these downstream 
^csacs iiKvesuo the tmaocea that amplified JJW A ydu. 
fread through chc typing lab, reaulon^ m a nslt of 



"carryo-er" false posiavee in eubwquenc lestmg . 

These downstream processus «iep* would be ettmt- 
nated If specific amplification and dewcaon of ampUned 
UNA took place simultaneously within an unopened re- 
action vesseC Assays in which such differed pxoce ices take 
place without die need w separate reaction components 
Lava been termed -homogeneoufT- No truly homoge- 
neous PCB. assay bas been demonstrated to ^tc. although 
pr Ofrress towards this end has been reported. Ghchab, et 
it 1 ", developed a *CR product dccccrion neheme ™»<r 
fluorc^eat prunen. that r*nd**d in a f|uor«cem PCR 
ptc^uet Aflclc-apecific primer*, with 
^ccait cap, -ere ««d to indicate the genotype of che 
DNA- Hoover, the unincorporated pnmtar. amt shU be 
rWcdmadc^nsiream proeeca in order to .vUualte the 
result Recontly. Holland, et iL^, davt^ped an |UO| <J& 
which the endugeno^ S' exonuclea^i assay of UNA 
nolymerase was exploited to den^a a laWfd obgonudeo- 
SeTrbbe, The^bc would only clem 
carion bad produced us complementary <*pefl~ . In 
ordfir to decea the cleavage products, bnw^jv^ a subs 
quent process Is again needed. 

Wc have developed a truly homoc«neour oiaay for PCX 
and PCR product deiecuon based upon the £t«Uy »n- 
creased 1 auorescence chat eihidiuio bronude and oche. 
DNA binding dyes .exhibit when they ere bound co ds- 
DNA 1 *- 46 . As outlined in I^urc I* a protocypic PCX. 
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RGWS 1 Principle 0/ simulcmoous ampUtodoa and dctcctoa of 
KrRprortua, Thccomponcna of a rtRcoriisir^ EiBt th^ orc 
auorcsccnc arc fotcd— Etflr iucif, EdV bound to adxer tsDNA or 
cUDNA. there in alar?e ftuoracenca «nh«irement wfccnRtBr U 
bauitd to DNA aad btridme u gready cnlianccd when DNA a 
double^tranded After suffidmt (n) cycle* of PCS. the s oej 
■increase in diDNA touIm ia edditu>«a £i3r biadiog. a*d a net 
in total fluorwecoce. 
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M 1 C^^^^^W^^ 
human, nuclear gcuc, r^fciow ttt/ini)- To« pr=«^= of 
S^noS - act SUe* of ampUO- 

canon. 



befoce xad ate*, or 



continuously during, chctmoc)'- 





•cquences. nrercpU=« odeof <h« five replicate 

fcpTfra: DNA «tt .^ c | u ^^^r«uS, 7 . (B) 
Ruorasce/tec 'W^JSEL ffl Je^H"^" HrP^ 



w» with pn»«, aw « > j5srSr i -SSt < 5' 

dsDNA containing tfaecrgec 3 ^"^ * y B dipeodine 
^ typically present. ^^^^JS7rfTS?A» » 

niicrociama par r<~K- ■ > ^,__„^__fl,j.orccceuce. arc 

free ElBr iudf, «nd kt»r » DNA (by 

DNA primers and ^ Wa If the UNA 
its intercaht^ b«c~«» fini dmSntion cycle, target 
double-helix). After the ^"^3^ After a ?C*. is 
DNA will be ^y.^^^^ iTt^mcr^cm 
completed, die most i^lf)ofup« 
the amount of daDNA J*^^ p a 
several micrograms- f om«riy ne * fj» • ^ flu0 re!- 
additional (tsDNA, resuHmfl L» * 0 unt of 
ence. Tfcere b also '^^^fo^rTo^NA 
«DNA primer, buc bcw*e : *« «Wg ° f ^ ^g,, oa 
i« .nuchas Chan to ^^;Srm fluof«- 
A* toed Huorcscencc aatc6as a£bn*ta 



RE5"U1T* ■ of EcBr. In order to chc 

J^J^ilc^wSrioS of the Wan HIA 
atfect of ElBr in ^^rJL^ ^ Ae dye proem a. 
DQn gene>° ^Jt^to £gU (a ty^iol eoneen- 
ooaccnnaito .6^ e f nucleic aid, foUovin, 

gcldectr^re^uCKS diffc^cc in th= yield 

eleCtr0 P h °Ir^.T™^c^n product whether EtBr ^ 

absent OT preicnt aL 
ina Chat €tPr docs co« inhl&it ri^K. 

i^rtn™ of human Y-chromojomc jspcaic 3C - 
Datectioa ot our _'" ( : fluoreacence aohanccmcot of 

^phficmont found on the human 

3-? fi<: W I2?£ ^c^T ioiti^y contained either 

cu^n: refleoa the rati that oy uae j, Na a 

.fluoreseco*. can be ^«^^ lU cyc la number, 
becoming ^ n ^^^X^U shout three-told 
As showti, flu ^^ 'i^nVc for thePClb contain- 
over the ^^^tlfdW no aeniReandy maeat. 
«g twitran male DNA, but dW not SB"^ ^ 00 

for negative ""^^^r™ niaU DMA 
DNA or hutnzn fcr^e iJAA. ^ n8 _th e fewer 
pr«ent to bepn ^fflK nuc 
cyel8« *'«re needed W grva » "~T" nroduccs of tK«« 
Seance. Gel Wj'Wj S A frlK^f «• 

SSSLn-Sc BADHA ryrttne^ tooK place tn * 

^racS^ increase *J«-^^S 
VyWy Hying ^^iC'^Kh • red 
crinaiUuminatorand P^^^L, ^ reart oio that 
filter. ThU » .hown In tore IBfcr d- 
bepn with anj m^ D^A ^ *oa^ p _ 

Detection of «p«e>ftc aueica 1 w 4P n,-oach h» 

kdc In order to detnotutwie r^^^ccuon 
& .P«^ » ^^rw-TplrS^Hsure 
of thTncWe^U anetBU ^^^"^d Amplification) 

* show, *^~3*K SS"-r. A®*; 

of the rcacnon ^rm^d uUne^mer, t pec^c for 
reactiom were P^ 0 ^^.^ of the htim» 
ther the ^d^pea.' nc^-ceu n iu ^ , 



1 



^y plactaf the ricWo^utato ^« " p %^?n^ 
nudeodde of one ^K^Ij osus a»- 
^in^-rure^uncr ^^.^ oj^. 

^nmer b COmp1cmec.ary.to : P-^ohtr g*» P ^ 0 f 
P ^.pairofamphficaoo^ «jhO^ Hg» ^ ^ ^ 
a reardon with cither *^ » p b , ^re. fhrc= 
tuba) or dcklc-dkU •P^.^g 1 ' ^ r^t««yg0»» 
different WAiW ^W*-, from n hetcroiyg^ 
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r -^tiow each). The DMA ryne vai reflected m the 
"ud« fluorescence incenndes In each p:ur of completed 
~m D Vflcuiona. There a aigruflcant increase In fiuora- 
^1* w>«re a fl-fflobm allele UNA matched the 
•SS^W. - 8 .pe^fluoionacter 

not ahowa), thi. fluoreaoeoct was aeout three time. 
^? 0 r=4< in » PCB. wher« both |i. 8 lobin adetes were 
^Kd » the primer C.t electrophoresis (not 
■S^ablkhed <U thu increase in fluorescence was 
S £= ryitthesb of Marly a microgram of a DNA 
ft£l»* of*. «pe«rd sia* tor ^Jte«wi 
ISSdiob of dabNA in reacrioat to which the alWe- 
Sr^rSer ww »l 6 inatch«d to both alleles: 
3 *t£&£* .-nimrias of a PCB. Using a fltor opdc 
j.«se it IS powible to direct excitation lUumwaaon &pm 
t PSrSfluSome« to * PC* undergoing 
"LIS 50 return i«s fluoreaeenee to the tpoetrofloorometer. 
n,« nuorescenee readout of oueh^ arru^emen^'- 
■Lud at an EiBr-ecntAininc ampticaoon of Y-cbromo- 
C«« tpecific sciences from 25 oc ^utrum ^e DN^ 
£,hown m FifUK 5. The readout from e control PCK. 
v&thno target DNA Is also shown. Thircr cycle* of PGR 
•5 l ^<fe monitored for each. , . ■■ 

4 The fluorescence trace as a funcooo of time deariy 
j h ows the effect ol the tnennocyenne. Fhioracc^e uuen- 
>Jw rise* and tails inversely With temperature. The fluo- 
wicence intemity is minimum at the dcn^ranon^K,- 
ibtreeure (WC) and maximum at the annealtoff'cawwioo 
Wratire (SO-C). In the negative-control FO»,.tW 
i&eenee maxima and minima do Mr change w«mS- 
ttndr over the thirty therwocydes. mdladng that there » 
iKrde <3jDNA synthesis without the appropriate tweet 
DHA, and there is little if any bleaehing of T.inr dunnr 
'the cimdnueus illumination of me sample. 
• "laic PGR containing male DNA, the ftuoresc^cc 
'jrinhna ac the annealin^te^i^ camperaruie begin ID 
likreasc ac abuuc 4000 acconds of 
'Sue to incrciuc ~ich ume. iodieauag that dsPNA * 
being producer! at a detectable lc-cl. Not. that the ri^ 
l Vwccace minimi ar the dcnawxuoon temperature do not 
/uan-Acmtly increase, presumably iicawcf dw t«-p«r; 
•«urc there is no dsHNA Tor EcBr to tad. Thwjh • 
■ of che amplificarion <s followed by mvcbhc the fl^«^ 
1 *«acc i*creo*« at the annealing temperature. Ar-aly^of 
product* of these two ampliations by gel clet^opno- 
:W showed a DN A fragment of the expected aire tor Uic 
ihale DNA containing sample and no detectable UNA 
tfKDCbcfils for the control tiro pie. 

^ Do^ua-cem proeenee. weh ^ hybr l rilnrirm w a sc- 
■qucncc^pceifie probe an enhanee the rpcaficity Of DNA 
^atccuon by PCIL Th« aliminaoon ol theie process^ 
i*c*m that the 3 peo£citT of this hoinogeneous assay 
[ttpcnk .Qiciv on that of PGR. to «e ^to"5^ 
tdbewe, we h»v« >hovD th*t PGR aloco haa suflBpent DNA 
Iwawc jpcriiiricy to permit ^eneoc xnepmf. Ustogf. 
^prapriaic atnpfificadon conditions there » liKlc noft- 
^ycifcc producdon of daDNA in th« ahsence of Oie 
•.approprlacc target allele. - 
5 The spedficiiy required to detect pathogens can , be. 
TRor^ or l«s tban that required to do Ijcncuc •cre.cinn 
:*Fcndlne on chc number of pathogeofl ia the samoJ* ana 
•^c amcunt of other DNA that must be taken th« 
triple A dOTcuk target is HIV, which re^r« 4«wctton 
i viral yencrae that can be 2X the levd of a few copies 
jj*t thcujandi of host cells c . Comp«cd ^ch £an«tic 
keening, whirh is peTfornird on cells containing »* l ? 3il 
?f*.»py of che tareci sequence, HIV detection requirea 
more sperinory and Che inpuc ot more total 
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tidenciiuaibeTSof °^«t^uen«, -uicresies 

che background ^?^^STta v dcte«cd- An 
auor«.:=«a product by WX muss ^ ^ 1o v 

oopy-numhcT 13 the t0 «^f° " _ J3ian 'of one pmner 
arufaa. Thb » the reeulc o< h tn U occurs 

uun<J rhe other P™^^^^ 0 * 

JEMS »pn J * ^XT'cfe of 
•10 Irxrea* TCR 'pcafia^ reduce ^ a n um- 

ber of approaches, induitm fiw u" « ^ d (b e 
^plifieSons that take P^'^^y reduced 
-hoi-cm", « «^ aw Trf twe^S« Seforc DNA 
by raising a« «paw^ apr 
rmthaai* begini". TreUnunary ^"g^^U reduced 

Sd it - PO^ » detect the SfflV ^ ln a 

background of lO 3 colli. WQ "^"jT, -ranoouc 1>NA 
bacXiround fluorescence OT^,^^^ ntay 
becomes problematic To reduce th* ^ffiL d ' 

the oligonucleotide P™""*; • ..^on „f flu<^«« 
Wc U ve ' h «^*''^Sb ^cbtfer-ard. 
ge^attdby an&B^con^^ ^ndpupusly during 
OOlh ifflKfCRa ^r3£«W* automat *f*p* 
thermocychiig. TKe Sat ' v. ^mntish ed b the <eo« 

promismg aspect of this ' ' ..u jaaBU-. 

of competed ^Tor^ djnuorea- 

mennnon m 90-wcll former-. »° ™» £ r u d; 

oaec in each PCK eaa ^X^hSS5^SV^«* 
even at '»^>n<SK-teij«w 
dia ocfc of PC*s to » 96-«v««> w e» F"™" 

A du^ct «K«.» t ion of Sn ught and flu- 

„uldpW RbaropoCS PCiu Tte ahSlity 

to monitor mulapleTCRs ^££^"^3 stKW s that 
dmdon of argot UNA copy ^"^g^ the sooner 
& urger the amount of «^»^f £££ Pieltai- 

n ^*p^^ 

concentration. , molecules U 

Coa> CT «ly. if tbo .^^J^S^oncinuo^ 
taoxn-HU it an be ui genetic 3 ^"^~^ ia}tt v0 ^ 

monitors f^Z^V&t^^^^ 
t.ve and fehc ncfrw-e r<*ul=. ^'^7^ detecable 

6dSre S rmee by a predictable ^^ter that 

Incrsajes io ftuorcaeenea ^."f^S^ r Stf '^8»*« 

cyde would Sf-^fSaSSS l»» P«^«" 
reSUlU due to. tor eaunpla.jnjuo" ^ 

fluora^ence Increase ^^"^J^^ decea a true 
a«a— many more th»n are nacossary w 



or absence of «t boaad on tbU 

be onportant. In »J ' b « 0 ^ 2«Sicnt of m fal* 

^^^^^tobcob^ 

S±T itSSS^PCR of Ares, -fcoee W . 
1q summary, the ui aw diDNA makes it 

rce«ce » e ^^?NA mpSlcwion front ou ai d s 
potnWe to detect ^^^^^0 upon tlu. 



BXPCWMENTALjJ^OTOCtM-^^ cosine 

PCnuwere«tup«">« u t' 4 Jr. Mv a. ; j.s oiu« of Tie dna. 

volymer-a £? I ' c, Sj^JJrr B ' :, '" e " 
of human HU\-DQo. g" 1 * *£^™Tl« It? eooi« of DQa I-Ct 
UKId shd OH27" -* tSSES rSian. sSudiutn fira.-d.-J 

R5'i352rftal ,, SS£a*d for 2« cyto - a .mod 



S3TS^ « fused ^rircyto ^ m^O, 
!l. lliermocyc^.P^t^ad for t ^ya cr>. w «B » "«•!>•: 

or no hun-n W^S^S^. TU oumher of cycles (or^ ; 
[ 0 ^C^» ufJScd 3- mdRSCtnee mca.urc-.l 

them u deterUKa bdow. . „ _. ee ..,u». ^ 



iem u deterUKJ below. bta K «. Amplifications of: 

<lc«cnb«rt tor H1A - U ^^ D J> eidi-J pnmar pair KG TO 

target DNaj. TWo* fCSaoeMslnee w ^ CTO gj V4S 

HtfHA fWd-typa S^^o^WchBrfmerper/a"-; 

hZan DMA dat «u h r ora ^^ /S^fW) or DNA 
uST^. hcterorypus f<« ' ^^"^^^.e — for J«J 

ptograa-iV 1 """•^tScEoSfi^ TaraptiBcaricn. Cotu P k»d| 



(sr£X Moon, ry/. r~jrr G C <S6 ^ cut-off W,3 

^wpdcDc^oiy i k placed in ^ 

5^S, and receive «ntac* ^ J; * Th« ^ 

(torn roctr jM* «4,*« ^ U ^ ^^pSaaon o! 
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IMMUND BIOLOGICAL LABORATORIES 



sCD-14 ELISA 

Trauma, Shock and Sepsis 




^-iha CO-U molocule le expressed on thR surtace d 
.monocytes and some macrophages. Membreno- 
Abound CO-14 is a receptor for lipopolyfieccrwi/idG 
■cJLPS) complexed to LPS-Bindlng-Proteln (LBP)v Tne 
." concantraiion of Its sc\vhte is aJtenad under 
••'certain patholog.cd condtiona. There ie evidence for; '' : >;=;'\; ; '7 -- . 
■ "an important rote of with poiycrsuma. snp^i 

♦ '•burnings ana inflammations. 

'/During septic conditions and acute infQctlons It seems. ... 
:-to be a prognose maiicer ana is therefore of value in -■■ 
I ^monitoring tnese padents. 

For more information can or far 



IBL offore an BJSA for quantitative determlnaticn of 
soluble CO-14 in numan serum, -piaama, oell-oulturo 
supernatants and other biological fluids. 
Asa&y feawrae: 12x6 determinations 

~\ (mlcmt'rtcr strips), 
" : ; v,/\5/ prccoated with a specific 

monoclonal antibody. 
• 2x1 hnvjr incubetion. 
standerd range: 2 • G6 
dctactlon limit i no/ml 
;: cV:intRr and ir.taraaaay< 8% 
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WISP genes are members of the connective tissue growch factor 
family that are up-regulated in Wn t- 1 -transf o rmed cells and 
aberrantly expressed in human colon tumors 

0»anf ^twnica't, Todd a. Swanson*, James W. welsh-, MaRGajlet a. Roy*, David A. Lawrlince*, 
James Jennifer brush*, u$ A a. iAMEYHlixfi, Betwanne DeueuJ, MjchaEl Lew*, Coun WaTaNabeH. 
Robert L, Cohen*. Mona P. M6lhem*\ Gene G. Finley* \ PuiLQyi&KEtt. Audrey D. OorjOAKfrt. 

KENNfiTti L HlLLAN 1 . AUSTIN L. GURMEY*. DaVITI BoT^TF.rNfctt, ano Arnduj J. Lsvtne* 
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Contributed by Ouvid Bofstcux und Arnold /. Ltyiic, October 21. 199$ 

ABSTRACT Wot family members arc critical to many 
<kvuh»pfncntd| proccjjcj, and components of theWal sicnal- 
iar pathway bare been linked to tumor igencsb in familial And 
sporadic colon carcinomas. Here we report the identification 
of two zeacs, WISP-} and WTSP-2, that ore up-rcrjuUicd in the 
mouse mammary cpldieilal cell Une C57MG transformed fiy 
Wnl-l, but not by WaM. ToettDcr wlfn a ifilrd related £cne, 
these proteins flermc 0 SuhftunHy of the connective 
USiue grwrn raoor family. Two distinct systems demon- 
Stratetf wrsr Induction to he associated with the expression or 
w Q f.l. These Included (t) CS7MG cells infected with 0 WnU 
retroviral vector or expressing Wnt-1 under the control of a 
tetracytui* repressible promoter, and (it) Wnt-1 transgenic 
mica. The WmP-1 genu w«a localized to human chromosome 
8q240-8q24 X WJSP-l genomic DNA was amplified is colon 
cancer celt linos and in human colon rumors and ita RNA 
over«cprcstod (2- to >30-fold) in tA% of the rumors axarotned 
compared >nth pa tic at- matched normal mucosa. WISP- 3 
mapped to chromosome 6q32~6q23 and also was ore re* - 
pressed (4- to >40-fold) in 63% of the coloa tumors analyzed. 
In contrast, W/SP-2 cropped to human chromosome 20q12- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced f2- to >30-foldJ la 19% of me tumors. Tnese results 
suggest that the WISP genes may be downs (ream of WnM 
slenailneand that aberrant levels or wrsr express ion in colon 
cancer may p'ay a role *n colon r.umorigencsis. 



Wnt-1 is a member of an expand in c family of cysteine-rich. 
glycosylated eignaline. protein! that modiato civerso develop- 
mental processes such as the central of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fetes (1 4 
2). Wnt-1 oricinally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinornas (3. 4). Although Wni-i is not 
expressed in ihc normal mammary gland, expression of Wm-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, wnr tamlly members initiate signaling 
by binctfne to the seven-fransmembranc spanning Fii22led 
receptors and recrulung the cyioplasmic protein Dlshevellea 
(Dsh] to Ihc ceH memorane (1, 2, 6}. Dsn then inhibits tha 
kinase activity of the normally constjrudvdy active glycogen 
synthase k(nace>3jy (OSY*3p) rorulting in an increaco in 
p-catenln leveu. Stabilized £ -eaten in inter act* with the tran- 
scription tacior TCF/Lef 1, forming a complex thai appaars in 



The puoilation costs of this an'de «cic udTayexi (n pan t>y pig« rtarge 
peyniwt This arddc mult fhcrctorc oe net coy morcca ~sjv!/tuem£.if In 

dccordoncc -Ith 18 VS.C |1 solely to Indicate this fc« 

CS 1903 by Tnc Nauoc J Ac^Itmyof So»o«* 00^3424 /03yaSt47l7^n.00/O 

WAS is aviiuoie onnac if iwww.oiiu.on. 



thc^ nucleus and binds TCF/Lcfl target DNA elements to 
activate transcription (7, 6). Other experiments suggest that 
the adenomatous polyposis coli* (A PC) lumor suppressor gene 
aleo plays an important role ia Wnt signaling by rcguUtui^ 
p-catenin levab (9). APC is phosphoryhucd by binda 
to 0-catonin, end mcilitttcj ici depredntioti. Kutatiuns in 
either APC Or p-oatonin have been associated with colon 
eaxcinomaa and melanomas, suggesting these mutations con- 
tribute to the development of these type:, of a a peer, implicauAg 
the Wnt pathway in tumori/;cncsia (I). 

Although much hafl been learned about the Wnt si^naline 
pathway over the peat aoveraJ years, only a few of the tran- 
ccripuonally accivuted dWnetream cotnponaita acuvatec bv 
Wnt have been characteriz?d. Thoce that have been described 
cannot aecount for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target eenee oro 
tnoce encoding tha nodal-related 3 gono, XAr3 t a member of 
the transforming growth tactcr (TC?t>|i sup erf amity, and the 
bomeoo ox genes, engrailed, goasecaui.n^m (xrwn), and siamois 
(2). A receni report also ulenfttcs c-mj*c as a target gene of the 
Wnt signaling pathway (10). 

To Iflemtfy aCdilional downstream genes In the wnt signal- 
ing pathway rhat 2re relevant to the transformed cell pheno- 
type. we used a PCR-hascd cX»Na subtraction strategy, sup- 
pression Subtractlvc hybridization (55H) (U), using kna 
Lviiaied fmm C17MO mouse mainmajy epithelial cells and 
C57MO cells stably transformed by a Wm-1 retrovirus. Over- 
expression of Wut-1 in liiia cell line is sufficient ui induce Z 

parcialiy transformed phenocypc. charaacrtzed by elonjratcd 
and rcfractilc cells that lojc contact inhibition and form a 
mululcycrcd array (12, 13). Wc rc;tsoncd that gene* differen- 
tially expressed between these two cell lines might contribute 
to cho transformed phenorypc. 

in this, paper, wc describe the doninc and characterization 
of two genes up-rcgulatcd in Wnt-1 Iran sibrmcd cells, WlSP*l 
and WTSP-2, and a third related pene, W2SP-3. The WISP genca 
are mcmbcra of the CCN tamuy of crowih footors, which 
includes connocuVc tissue growth factor (CTCF), Cyr6l, end 
nov, a family not previously linked to Wni signaling. 

MATERIALS AND METHODS 

SSH. SSH wa< performed bv using the PCR-Salec: cDNA 
Subtraction fCit (CIjONTECH). T^ter doublo-slranocd 

AnifevlatlQfYS: TCF. Uin^orsung growth h»cUin CTCF, connective 

ci»«uc e^u^ili factor; SSI1 suppression subtractlvc nyonointio^ 
VWC. voo WilUbrond factor typo C module. 

Data deposition: Tho Mqu«oc«< reported m ihU pspor h«v« b<wn 
dcpositsO in the r.enban^c darabasc (iccess'oa nos. AH0CT/T7. 
AT10077a AF100779, Af 100780 «jid AHOOTbt) 
To ^*hom rapnnt roc^uesU should be addrcassd. e-mail: diano@geac. 
Com ^ 
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cDNA was synthesized from 2 w of poWA)- RNA isolated 
from the cy/MG/Wnt-1 CcU line and driver cDNA from 2 jxg 
of poly(A)* RNA from the parent C37MO cells. The sub- 
tracted cDNA library was subcloned InlOapGEM-Tvcctotfor 
further analysis. 

cDNA Library Scraening. Clones tncodln* full-length 
mouse H75y-J w CC c isolated by screening a AHtlO mouse 
embryo cDNA library (CLONTECH) with a 7iM>p probe from 
the original partial done S6S saquence correspondta* to amino 
acids 12S-169. Clones oncoding full-length human WISP-} 
were isolated by screening AgtlO lung and fcUl kidney cDNA 
ttomrics with the some probe at low stringency. Clones en- 
. coding rull-lcnglh mouse and human WISPJ were isolated by 
screening a C57MG /Wnt-1 oc human fatal long cDNA library 
with a probe corfespondine to nucleotides 1463-1S12. FulU 
icngth CONAs encoding WISPS were elonad from human 
bone marrow and fetal kidney libraries. 

Expression oi Human *W RNA. PCR amputation of 
firSUitrand cUNA was performed with human Multiple Tiasue 
cDNA panels (CLUNltCK) and 300 uM of each dNTP at 
9VC for 1 sec, 62*C for 3U sec, 72*C for 1 min, for 22-32 cycles. 
WISP and glyceraldahycU-3-pnosphate dehydrogenase pumcr 
soquencaa arc available on request. 

in $ti* Hybrtdiwtilan. ^P-lahclcd sense and antisense nbo- 
probea were transcribed from an S97-bp ?CK product corre- 
sponding to nucleotides 601-1^0 at mouse WOT-/ or a 
294-bp PCR product corresponding to nuciantldes 67.-571 of 
mouse WtSP-2. All tissues were processed « described (40). 

Radiation Hybrid Maopinc. Genomic DMA from each 
h/Ofld In the Stanford G3 and Genebridge* Radiation Hybnd 
Panels (Research Qcnclics, Huntsvillc, AL) and human and 
hamster control DNAS were fCR-amplified, and the results 
were submitted to tne Stanford or Maasexhueetta Institute of 
Technology web servers. 

Call Unas, Tumors, and Mucosa Specimens. Tissue speci- 
mens were cctained from the Department of Paiholosy (Uni- 
versity of Pittsburgh) lOr pahcnCS uadCTEOing colon resection 
and from the University or Leeds, United Kingdom. Genomic 
DMA was isolated ((Jiagen) from me pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human call lines! bWdgO, COLO 320OM. HT-29, 
WiDr, and SW403 (colon adenocarcinomas). 5W62Q (lymph 
node moiastasia, colon adenocarcinoma J. HOT U6 (colon 
carcinoma), SK-CO-L (colon adenocarcinoma, ascites). 2nd 
HM7 (a vuiani of ATCC colon adenocarcinoma cell line LS 
174T). DNA coacentrotion ^as determined bv using HoechSI 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by uomogenizttion m?M GuSCN followed by centfirugation 
Over CsQ cushions or prepared by using RNA20I. 

t iene Amplification and RNA Expression Analysis, Relative 
gene amplification and RNA expression of WtSJ* and c-nyc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR Gcno-specific primers and 
fluorogenic prOCes (sequences available on request) ware 
designed and used to amplify and quantitatc the eencs. The 
relative gene copy numoer was derived by using the formula 
2u«j where ACt represents the (inference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphooyle DNA compared with colon tumor DNA or colon 
tumor RNA compared with norm2i mucosal RNA. The 

0- method was ueed for calculation ot the SE of «he ^enc copy 
number or RNA expression level. The vmrMpecitiC Signal was 
normalized to that of iha glycenldehyde.-3-phospnatc deny- 
dreaenasc housekeeping gene. All TaqMan assay rca£e0ts 
wcie obtained from Pcrkin-Eimcr Applied Bioeystems. 

RESULTS 

Isolation or H75/M and WJSP-2 by SSH. To identify Wnt- 

1- lnducihle genes, wc used the technique of SSH using the 
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mouse mammary epithelial cell line C57MO aodCS7MG cells 
that stably express wnM (11). Candidate differentially cx- 
prcssod cDNAa (1,3 total) were sequenced. TTuny-ninrt 
percent of the sequeneos matched known genes or homo- 
logues. 32% matched Mpreased sot;uenoc tags, and 29% had 
no dutch. To confirm that the transcript was differentially 
expressed, semiquantitative reverse trnnacription-PCR and 
Northern analysis were performed by using ujRNA from the 
C57MG and C57MG/Wnt-t cells. 

Two of the oDNAs, mSP-J and WISP-*, weec differentially 
cxprcasod, being induced in the orMCr/Wnt-t ceJ« line, hut 
not in the parent C57M.G celU cr C57MG celk ovttexptesslng 
Wm-^V (Fig. I A and J9). Wnt-4, unlike Wm-1, does not induce 
the morphologiceJ wansfonnotion of C57MG cells and has no 
effect on P-catcnm levels (13, 14). Exprcwion of WISP-) woe 
up-reguiated approximately 3-fold in the C57MG/Wm>l «U 
line and WISP-2 by approximately .1-fold by both Northern 
enalysis and reverse transcriptioru^CP . 

An indopondent, but Cimiiar, system was used to examine 
WISP expression after Wnt-1 induction. CS7MC cells cxpress- 
-Ine the Wnt-1 ecne under the ccnWOl of a retracyclinc- 
reprcsslblc promoter produce low^ eniounta of Wnt-1 in the 
repressed State but Show a strong induction of Wnt-1 mWlA 
and protein wiinla 24 hr after rc'racyclinc removal (fi). The 
levols of Wm>l and m'SP RNA Isolated from Ihcsc cclU at 
vanoue times after tetracycline rcn«rvat were assessed by 
quantuetivo PCR. Strong inducrion cf Wnt-1 mRNA was Seen 
as early us 10 hr after tetracycline removal, induction Ot WKP 
mRNA (2- to 6-fold) was seen at 4fi and 72 hr (data not shown). 
These data Support our previous observation* that show that 
WISP induction IscorreJaTCd with Wot-1 expression. Because 
tho induction is slow, occurring after approximately 48 hr, the 
induction of WISP* may be an intUrcXl response to Wot-1 
signaling. , , 

cDNA clones of human WISP-l >*'ere isoUted and the 
sequence compared with mouse WISP- /.The cDNA sequence:: 
ot mouse and human mSP-l were 1,766 end 2,830 bp ia length, 
respectively, and encode proteins of 367 an, with predicted 
relative molecular masses of -40.000 (M r AO K). Both have 
hydrophobic N-terminal Signal scqucnLes, 38 conserved oye- 
teioe reaiduec, and four potential N-Unkcd e'ycosylttion sites 
and ore 84^ idonu'cal (Fig. 14). 

Full-length cDNA clones of mouse and human WJSf-2 were 
X .734 and 1,293 bp in lcn«ch, respectively, and encode proteun 
cf 251 anf 7.10 aa. respectively, wirh ptxxUcted rolativd molec- 
ular masses of ~?7.0rtn tJW, 27 K) (Fie 2iJ). Mouse and human 
W1SP-2 ere >3% identical. Human W1SP-2 has no potential 
N-Unked glycosylation sites, and inOUSC W13P-2 has one r.t 










KM 



Pic. 1. < sod WISP-l arc inducud br Wot-l, but net Wnt- 4, 

expression In C37MG will. Korthcni anal/lie ot WIS?*} {A) and 
WISP-2 (B) cwewion in C57MC, CSTMG/Woui. snd C.^Mt./ 
cclb. Poly(A)* RNa (1 w« mhjecied to Morfhern 0'ot 
wioly^* und hybodlxea wish a "«5««p raoube H^f^speoAu yiobc 
(unLo «idt 373 -3UU) or a l^-hp WAT'O-spcdlH.- jjtooc (auclcotul« 
U3E-1627) in the v untranslated tcglon. Bluu were rdr/briduwd w.ih 
human ri-c=tin probe. 
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F,c 1. Encoded amino acta JW""*"* twE^S* 
Irenun OTSM (.4) slid mouse aid human IW-SP-J («)- ^ Pfi"'!* 

VWC Ihronbwundio (TSP), and C-iermmsl (CD OomsiOS are 
underlined. 

poeition 197. ITOM list 2S cysteine residues that aire eon- 
served amoni the 33 eyttcines found In WISf-1. 

Identification ot WISP-*. To search tor related pnteuu. we 
SC^ned expressed ..<»e»- , S (EST) ^ 
WtSP-1 protein sequence and identified sevenJ «rs» 
potently related sequences. W. Identified a homologous 
£outo tL we bV called WISP-3. A fuUJenjth ttmnan 
W1SP-3 cONA of ). Op was Hoisted corresponding to mote 
EST* that enco« a J5*-aa protein with a prodded 
muJ of 39,293- WISP-3 has two potential N-ltokcd 
jUion sites and 36 eyrteine reridues. An aUcnmcm of 0»e thrae 
human WISP p~«eio, show. *at Wisr-t «4 W«Ww *! 
Snular(42% identity), ^STa^^l?^ 
»ith WJSM. and 32% identity *>th ^ (J* .„, 
HWPx AT* Horaalosou, to the CTCF Family of P rtttUS. 
Human tTOf-/. and WW fxe novel 

however, mouse W.t A-/ is UK same » the recently 
£tr.J sens. Hmi is expressed In low, but not high, metastatic 
mousa malanoma cells, and suppresses the in «»o growth , «d 
metastatic potential of K-Y73S mOUSC meUnomc colto (15). 
Human and mouae WtSP-2 a« homologous to the reecody 
described rat gene, rCop-I (16). Significant homoloa -06- 
*4«) wassccnto the CCN family of 8 rowto ttCtOIS. 
includes three members, CTGF. Cyr6l. and 0* F££°«0- 
gene nov. CTCF Is a d.cmotacuc and autogenic ««rto 
fibrobUtu thst Is implicated in wound heeling ^**°<* 
disorders snO Is inducer! oy TOF-fl (17). Ctt61 «'«".^f*'" 
tulor matrix signaling molecule that ptcmotes call adhesion, 
proliferation, migration, ang.oecnesfs. and tumor 
19) nov (nephroblastoma overexposed) is. an immediate 
early rvne associated «tlh quiescence aflC I found altered .n 
WiUna tumors (20). Tito proteins of the CCN A****" 
functional, but not sequence, similarity <o Wnt-t AU «« 
secreted, cystcinc-rich heparin b.nding glycoproteins (hat as 
coaate with the cell surface and exloca luUr.malT.* 

WISP proteins exhibit the modular orclutcerurc ot the CXN 

hmily, rturaaertied by tour conserved ty^*?****** 
(Fig. 15) (21). 'IheN-tc-rminal domain, -hich mclud^t lha fin,t 
12 cysteine retiduea. contains a consensus sequence 
CXXC) conscrvad in mort insutin-U>CC rrowth factor (KM 1 )- 
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Fid ' (X) Encoded amino acid «q<wf». alignment of human 

nmnt inwSf-3 are indicated wid. » dou (./r) SeWBMle »wwf^ 

«iSu*, fv.rti«l llootVThe tour eystf me rovluo m <KoWC domain 
b£! i« WtJf-H are indicsud »id. a do, (C^f 
WT£P mRNA in Human msues. PCR performed oa hum»« 
eXtaWe £&pLds (CLOKTE.CH) fmm me Wicsteo aou't 
xmt fttal tissue. 

binding proteins (BP). Thi* cequenca Is conserved H WI9M 
and WTSP-3. Screes WISP-l has a glutamlne in the third 
Position Instead of « CTCF reeenoy has W>A» 

to speeKleally bind 1CF (22) and a truncated 
laclciha the IGF-BP domain ia oncogarac (X3> The von wn 
lebrahd factor type C module (VWC) also /ound » certs, n 
colUgem and mucins, covets the net 10 eyeta.ne residow, srd 
i, iboughC to participate in protein complex formation aua 
olSarization (2d, " Ttm VWC doouin of WISP-3 djffers 
from all CCN family members described P«cv«:usly in , that .t 
contains only six of lha 10 cysteine residues (Fl£- 3 £ and *). 
A short varilble ra C ion follow, tha VWC On**™ 
module, me thromboaooodin (TSP) tomau. « ^edta 
bincl.ne W Solfatcd ElyccTeooiugaloe and contains W cysteine 
rcsmuf. anO a conserved WSxCSxxCG mouf fi« >o>ntmed in 
rhromoospondin ('3). The C-lerminal (CT) module eootsiu- 
ing the remaining U cysteine, il thought to ba mvolved m 
dimariaauon and'recepcor b.ndlnc (26). The CT dom«n . 
pteosnt in all CCN tamity members described to d«ta but «■ 
absent m WISP -2 (Fig. 3 >{ and B). TOC cx.r.renee of a putative 
Signal sequence and the atrtence of .» transmembrane donuun 
surest that WISP, are secreted proteins, an 

ma P mnialian ceU and baoulovirats cultures (data ««i 

Expression of *1SP raRWA in Human rucuaa . U««- 
speeifie exptession of Human MSPs *as charaetaruad by PCS 
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analysis on aduli and reiai multiple tissue cDNA panda. 

expression vas seen in the adult heart, kidney. lung, 
pancreas, placenta, cvary, small intestine, and spleen (Fig. 3Q« 
Utclc or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral Olood leukocytes, prostate, tesdsVor 
thymus. WISP-2 had a more restricted tissue expression. £nd 
was detected in «duH skeletal muscle, colon, cvary, and fctal 
lung. Predominant expression of WfSP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WlSP-3 
expression were detected in placanla, ovary, proctato, and 
small intestine. v « v » 

In Situ Localisation of WISP-I end Wt$P*2. Expreaaion of • 
WlSP-1 and WISP -2 w^ assessed by in sine hvbnditation in 
mammary tumors from Wnt-1 transgenic mice, Slrcna expres- 
sion 0/ WlST-l was observed in stromal fibroblasts bi"C within 
the fibcovascular rumor stroma (Fig. *A-D). However, low- 
level WISP-I expression also was observed focally within tumor 
cell* (data noi shown). No expression was observed in normal 
breast, Lite WtSP-l, WSP-2 expression also was seen In the 
tumor etroma in breast tumors from Wnl-1 transgenic animals 
(Fig. 4 £-H). However, WISP-2 expression in the stroma was 
in spindl e-annped cetta adjacent (O capillary vessels, whereas 




PlO. 4. (A. C. £, snd O) Representative hcfnitocrtWcoain-staiacd 
images from breast tumot* in Wni-1 craiuGcnic mice The correspond- 
ing dtrk-ficld imago* shoeing WISP-t axpr*6dion V© shown in S and 
D. Th* lumor ia a modonualy weU-ditferantialcd adenocoraam> 
showing 4videnc« of ^danotd cystic change. At low power -fyi; Utl H). 
expression of WfSP-J is seen in the delicate oranchini fibruyasciiUr; 
rumor stroma (arrowhead). A' hi£her magn location, exorcasian if seen 
In the strornal(>) fibroblasts (C aitd O). md turner cells turc; n©c«^o. 
Focal exorenion of W1SP-1, Kowcw, *<aa obacrvod in tumo/ coJUun 
lomc areas. Images of Wi$P~2 expression era shown in £^ r .^jJow 
power (fi and F), oKpc*«icn of W1SP.2 is teen in cells tying wiOunftid 
fihrovaicuUr tumor stroma. At U'gncr m3£niQcaUoh^tncsc , cefl5 
appeared to be adjacent to capillary vessels whereas rumor cells arc 
nepcivc (G and if). ' .'- 
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the predominant ceil typo expreasirg WlSP-l wae the stromal 
fibrObleJtS, 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human HOT genci wu determined 
by radiation hybrid mapping panclc, W(SP-1 is approximately 

\m cR from. the meiotlc marker AFM259xc5 (logarithm of 
odds (tod) score ifv*< jon chromosome 8q24.l to6q24.3. In the 
same region as toe Human locus of the novH family member 
(27) and roughly a Mbs distal to c-*t>c Preliminary fine 
mapping indicates that WlSP-l is located near DBSl Ul STS. 
WJSP.2 is linked to the market SHGC-3.1922 (lod ■ 1,000) on 
cnWnaosome'2b<}I2J0qU.l. Human WISPS mapped tc chro- 
mosome 60.22-6^23 and ia linked to the marker AFM21lae5 
Qod - 1,000). WISPS ie appraximatery 13 Mbs prxximal to 
CTGF end 23 Mbfl proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Abcrraa t Exp neseloo of WISPs in Human 
Colon Tumors. Amplificetion of nrotoc ncogenes ia seen in 
many human tumors and has ctloloeicid and prognostic a«V 
nffJcahce. for example, in a variety of tumor types, c^mvc 
ampiincatiofl has heen associa'crl with jnalignent progrcaskm 
arid poor prognosis (30). Because 1*7. V-/ resides in the same 
general' chromosomal location (Aq>4) as c-myc, we asked 
whether it was a target 01 gene amplification, and, W SO. 
whether this amplification was independent of the c*nyc locus. 
Genomic DMA from human colon cancer cell lines was 
assessed by quanciutiva PCR and Southern Wot analysis. (Mg. 
5 A and B). Both methods detected similar degrees of WJSP-} 
amplification. Moet oell linos showed significant (2- to a-fald) 
ampliScttion, with the HT-29 and WiDr cell lines demons trat- 
ine an : 8-fold increase. Significantly. lh« pattern of amplifica' 
tiun observed cid net correlate wish that obeotved for c-myc t 
indicating that the e-rrrvc £cne is not put of the amplioon that 
involves die locua. 

We next examined whether the WlhP cencs were amplified 
In a panel of Vi primary human colon itdcnocarcinomas. The 
relative WISP gene copy numher In each coinn tumor DMA 
was compared with pooled normal DNA from 10 donors hy 
quanttiaUve *»CK (fig. 6). The copy numoer of WJSM and 
WISP-2 was significantly greater man one. approximately 
2-fold for WJSP-1 in about 60% of the tumors en£» A- to Moid 
for WlSP-2 in 979o of the tumors (P <: U-UU1 lor each), t he 
copy number for WlSP-3 wis iod is tin gviish able irom one (f = 
0.166). fn addition, th« copy number of WISP-'J was signifi- 
candv hifiher than chat of WfSP-1 (P < D.001). 

The le>elt> of W tranacripU in fcNA kolatod from 19 
adenocarcinomas and their matched normal mucosa were 
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Fio. 5. Amplifieoiicn of WlSP-t «nomic DNA in colon c»4»cex cell 
lines (A) Amplification m cell line DM A aaj determined hy Quanti« 
tatiwe PCR. (fl) Southsm biott cootam<n«j gnomic D^A ('0 uZ) 
digested wjca/^RI orxnol (c-myc) ^crc hybriditcd 

3 lOO-bp humiia TO/-/ urjbc (imi;iO ac«dj 166 -a°) or a human 
c«m|C. probe (located at by 1903-2000). Th« WISP and mjc caoda ar< 
dotoctod an normal human g«no*nic DMA of tar a long*/ film oxpocurc. 
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, « * r*-« • v = = iii - 2 2 £ 3 fl ? ? P ' 

Tumor Number 

F,c. (v Ocaomic amplified of Wttf tt£g**g& 

fll - rt „ The relative 2 cue cony oumbor of tt.e w«r BW>«;»«.*- / 

OKA f torn Primary hanuo ^ Nhl pooled f 1 »£2 
donora. Th. d»* »« <«ins ± .stM from OTU. "P 5 ™?"* 0< " w 
^..o. Th. eapeximent wis repealed »< least *~ , ; ;r 

^cd By quantitative PCR (F«C 7). Th. level of MSN 
^2 present in minor tissue «cfed 
bawwd I J- to >25-foU» in B4» (16/19) of Ao humaneoloo 
eLinod compared with normal «H«cent ou.co.a- 
E^o£19wmo««how«d greater than I0-(oldover^r^c«. 
in contrast, in 79% (15/19) of Ac wok eaamtn* «W 
KNA expression wassignineandy lower the tumor than tnu 
meeo £ Similar 10 »M>-I. H«M UNA -a ovocexprc^dm 
(3% (12/1W) or the colon Timor* compareo with the normal 
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Flc . 7. tWJF RNA exoresslon in primary Human ^ ^T" 
«UI-ve u, expression in ..cym.1 muooae f ^ m * C ^ai^ W 

S3 '. £££ t«. dac, ,« ca ein£ .-: spx j-.^sssr-' 

dona in triplicate, me experiment wit icyc««cd « t"»< «. •«! 



Pwc. Afad. Acad. So. CJiA 9J f.'W) 1*724 . 
mueoa.TlieamoontofoverexprcssionarH'/j/'.Jraneodfron, 
4- to >dO-fold. 

DISCUSSION 
One .pcroaeh to understanding lh{ mtilccula basis of cancer 
" »■ Sly differ.ncar in gene expression tenrca I «ncer 
and normel cells. Strata. b«ed on assumptions Aat 
^dy7ta« mRHA levels w5l differ oetween normal and 

^ ^ idSti «onct lata** expressed in 

of *e 8«es plated, WKM. WlSP-2 ^ «WW, 
3 re memocrs of >Jlc CCN family of crowd, fcexorv -ta* 
include. efGF. Cyrfl. *nd nov. a fawdv not ptcoucly Unked 

"SJSSSSL experimenta! aynctna demonotr^ that 
»75F induction was artociaied wi'ft the cxoresiOQ of wnt-1. 
rE£* CS7MC cells infected with a Wnt-1 t*irov.r.J 
^«o" or CS7MG eelU expressinj Wnt-1 under the control of 
7 e J.^r,pr«rible promoter, and the «^ «> « 
Wn«-J U mice, where breast Ml ^g"^ 
whereas nocmal breast tissue dcat not. No WISP Kj<Aexprca 
ston^^s detected in mammary rumo.t l«^C£0 by polyoma 
^«^ddte T .ndeon (data not thown). Ihcse data surest 
SKX ^ it. that in the* two vmtimtt. 
& faS? was cotralated -i«h Wn<-1 ocpre^-oo 
. STdHI whether the MS* ere d, J-ft* "JSS 

couTdrc^U from wnt-1 sitnallm diroedr 

transcripuon fictor reeulation or altemaurcly through wm a 

SS.«n*l on a aanscrlptlon facto, wluch « tun, 

f ^jwJ8Sfr. »- addUlona. SUWamilyof the CCN family 
of crew* laetoB. One vtr.kiog th.rcrcnce observed « the 
L^!«iu«nee ot WlSP-2 « the .H-ssnce Ot a CT demur. 

Snot motif exit aa doners (32). It .v 
A'ISP-1 and WISP-3 may exist a* dm.*", w^etaas wi>r 
exLis SS i nonSier. If the CT domain * atoo important rt.r 
exiSU ss ^monotn« re ceplor through a 

receptor binding, wiar-z may oina i» r «JLa- 
diUetent dglon of the moleeole th.« the other 0»6g 
memtart. No speclflc receptors have been ■toUoadfarCTCF 
o; nov. A recent report has Shown lh..t .«te R r.n «A s«vea i» 
in idbeiiori receptor for Cyrdl (33). . ,. 

Th* atrong a^reaclon of and « 

widaih tho rrbrov«eu)ar tumor Stmn.a „ brceat tamon ^om 
Wnt 1 wansaeiiic animals is eonsbient -.th prov.oua obatr- 
Zti £ Tanrexipts far the related. CTGF gene we, p* 
miruy cxprasiad in the tibrous Siroma of mammary rumera 
SSJmSc* «e though, to conctol the prol.ferat.on o 
«nn«?Wc ti«ue stroma in mammu.y tumora by a eaacede of 
S factor aie.naU timilar to that coiumllms conne«.vc 

^"h ffctors iafaiimolew the prcductton ot ClOf and 

Tw^fnStbat WW-/ and 

observed in the Siromal cell, that J ,>r.o U nded the tumor aUr 
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(epithelial ceils) In the Wm*l transgenic mouse CBcdont of 
br«act titfue. This finding 5Uggcab ihti paracono signaling 
could occur in which the stromal cells could supply WIS P-l and 
WIS p. 2 co rccuUtft tumor ce»' growth on the WISP cxtraccl- 
luler matrix. Stromal cdlUdcmsa factors in the c«rtccliular 
matrix have boon postulated to play a fO«c in 
mierauon and proliferation (35). The localization of W5£/ 
and »7SM in d»e strvmal cells oT breast tumors supports this 
ptrttcrinc model 

An analYfiifl of W/SM gone amplification and expression^ 
human colon ronton ihowcd a correlation between, pNA 
amblitlctdon and ovcrcxpraasion, whereas wercxpressfqn Ol 
W1SM RMA waa seen in the absence of DMA amplication. 
In contR«, DNAwu amplified in the colon, tumors, 

but iu mRNA expression waa significantly reduced ui the 
majonty of rumors compared witli the expression in normal 
colonic mucosa from me same patient. Tno eene for human 
WtSP-2 wsl; localized to enromnsnrac 20ql 2-20ql3, at a regie in 
frequently amplified an<? associated with poor prognosis in 
nod« negative br*a« cancer and many colon canwra* .^test- 
ing the exigence of one or more oncoeenea at thta locua 
(36-38). fiecauce chc cen^f of the 7flql3 amplican has no^yol 
boon identified, it is possible that the apparent amplification 
observed for WISP-2 may ne caused Oy another icpcaiKthis 
amclicoo. x • ' . 

A recent manuscript on rCop-!. the r2C onhplogue of 
WlSP-2, describee the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in coll lines (16). Although the mecnanism oy wnicn 
RNA expression ia down -regulated d«nn 0 maUgpant 
transformation is unknown, the reduced expression of WISP-I 
in colon rumors and cell lines sweats that it may function as 
a tumor suppressor. Tticsc rcsuib show that the WISP genes 
are aberrantly cxprcssecl In Cdlon cancer and surest that their 
altered expression may confer selective growth advantage tc 
the tumor. . . 

Memben of the Wnt signaling pathway h^c been tmplH 
catcd in the pathogenesis of colon cancer, brcasr cencor. ana 
melanoma, including the tumor Sflpprcssai jene odchomatoua 
polyposis coli and B-catanin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation, ol 
cytoplasmic p^cacenin* which presumably comrtbttcs to- hu- 
man carcinogenesis through the activation of Utf CCt genes such 
as the WISP*. Although the mechanism by which Wnt-1 
transforms cells and induce tumorigencsis ic unknown, the 
Idenrificaiion of WISPs as cenes that may be regulated down- 
Stream of Wnt-1 in C57MC calls suggests they court Oe 
important mediator* of Wnt-1 iraneformauon. The imd^ca- 
oon and altered expression pattern* of the WlSPt in humac 
COion tumors may indicate an important rolo for these genes 
in tumor development. 
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Baker tor technical assignee. T, Dcmd for raouwm hybrui mapping. 
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Real Time Quantitative PGR 

Christian A, Heid, 1 Junko Steven*, 2 Kenneth J, Livak, ? and 
P, Mickey Williams 1 ' 3 

^GioAnalyticai locnnoiogy oepniuufcnv, Cenerttcch, inc., South San frWicisco, Calitornle 9408QJ . 
^Applied *oSy*ems Oivianm o'f r&kln Clni* C^i^v City, California; °*ifW 



We have developed a novel ~real «m«" quamlutUe PCR mciliod. The method mea«ur<x PCR l pradain 
arrumuUfion ihraueh 4 dual-labeled fUioioeenie probe (Le., T^Man Frol*). ThU mwhort provldoc wy 
accurate ami reproducible quantitation or g«nc copies. Unlike omer quamiiailvt PCR methods, real-time IXK 
docJ not require post-PCR sample handling prov^mluB potential VOi product carry-ow examination and 
reiulrlnf (n much faster and hfelw diro.ujjhpur"jis»ys. The rwkliwe FCR mcAod has a very law dynamic 
ranrre of jcanlng «rw morula deUrn^natlon (it liriu live ordars of magnitude), toUlnw auantiiarlvc 
TCR Is extremely accurate and less UfeorMntertirlve than current riuanriracive KR methods. 



Quantitative rniCNuc acid sequence atfaiymnas 
had an important wife iii numy fields of biologi- 
cal research. Measurement of genu* expression 
(RNA) has been used extensively in nxmUoiJpg. 
blolofjloil responses in various stimuli fiViii c{ iii.' 

HuaiiR el aL 199$b,Ij; Wud'bomme el al 
1995), QuaniiUUvr j/cnr analysis (ONA) ha* 
In-on uivd to <M Ermine the gUfHlll tf# ^ujullly of U 

particular gene, iii the cue oi t he hummi fftR2 
gene, wind i Is amplified In -30% c»f breast tu- 
mors (Macnoii el al. 1587). Gene ami gcaqtne 
quantitation (pNA and nNA) also have been tiWoV 
fur analysis of human inununodcfldehCy virus 
(UlV) buTden demonstrating enanges U\ the lev- 
els of vlnu thTOughou l the Otf fared phase* of the 
disease (Connor r.r i] m 1993; JfliHak ct 3 1, }WMr. 
Furradn oi a(. lyvs). 

Many methods have ken described for tVu: 
quantitative analysis ot nucleic acid *e:|uenecs 
(both for UNA ond DNA; Southern iv/^; Sharp a 
al. 19H»: Thomas i9xn). Recently, pci na* 
proven rc be n powerful tool for quantitative 
nucleic acid analysis. k;Ic and reverse rramcrlp- 
we (K'O-PGR iwvc permitted the anetyali of 
minimal sraning quamUies of nudcic aiid (as - 
litUe Hi &ne c^M v^uivalcnt). This hes made pbs- 
alblc many experiments lhat could uoi hnyc hecii 
pwffj/oiud wl\h tradiUonal methods. Although 
PCR lias provided d powerful tool, it is iinpcrarivv 



' thai U Dc UbeU pit>pcj.ly ror quajnUu\lon (Wa*iy- 
mackeT* 199S), Miny early sports of cjuanliU- 
Ovi- I'CK and RT-PC* described quantitation of 
. ihe ^CR product but did nnt measure th« Inlllal 
tkr^H .-sequci ice quAutlly. Il U cssenliaJ to design 
proper control for the quantitation of the initial 
1/ugvT >cqucnccs (,Mcrrc 19^2: Clumcnll et al. 

lis-^archcn hry*. devclopcxl several raetbods 
m quantitaiiv* I'CK and UT-HCR. One approad^ 
meaaureji PCk product <ju»nnty In xhr plvis* 
/if lite reiKHon before tne plateau (Kdtoec, et al. 
lOPQ; Vang «< «K itHia). This method r«?uircs 
ihat each sample to equal input amounts of 
: nucleic acid and that each samj'la undo' analysis 
amplifies mtJi kknlk al efficiency up u> the point 
hf quanta* tivo analyjJc A gene sccrucnco (rnn- 
taliivO iff all aomplc* ot relatively constant quun- 
t;t5«t, such a* M<?lin) can bo uwd for JwrniHe. 
umilliricalion c/lidency normalization. Ufiln« 
ttinvenrtona] in<itnods of vein detection and 
quaintiufliuri Ojd elecrrophorciis or plal« capture 
hyrtrfdl7JltUin}. It Is ejdremely larmrlous to assure 
that all 5«inpk'3 urt nnalyml during tliu log phase 
a/ flic reaction (furlMJih tbc t<i^cl gene and the 
honnmlizafifin gane). Another rnci hod, quanuia- 
tlve competitive <QC)"N'R, ha» been developed 
and U uted vyidciy for Pc:R qu'jmrution. Q<M*CR 
n;lic$ 6:i the inclusion of fln Internal c<mlrol 
couipcihor in each reaction (Rnrkrij-Andrc 1991 ; 
H»tbk ct al l^Ja ( l>). Th« cadency of «?ch ro- 
acuori is nomullxrd to die Internal competitor, 
a biidwn JWfniun of ini«-mol compoUlor can be 
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addod in ouch mimplu. To obUMl rj??«lV(' 
tmlon, the ujiUnctwn larsct PGR pf oduci " com- 
pared with the known cnsnpclilar l^fR product ; 
Success v( a qua m Unlive competitive lO asjay 

relies on developing an internal control that am- 
jOirutv wUh the sante efficiency tliv nu^ut «*>oi. 
oculo. The deMfcn of the c*J7ijv«-"Mo>t nndlhv. va)I- 
naiton or amplification efficiencies jcqufrc a 
dedicated effort. Hv«mv«^ bc<au«c QC-J'C:U 
liuticqiiiR' Uiai PCZRjfuiUucls bcjivalyxcU during 
che ioe ph**e of Uiu *iupUf*cal(bn; U f* D<m incite 
<.#f ilic iwti methods ta use. 

Severn I dtfrucuun *y*lcnu «ic u*tnJ Toi qiein 
tlifttlvv- VOt U'lM'CU a.naly*i<» (1) ii^^ac 

ycls, (Z)/iaurao:iii IbUcliffX nf K *H pfooucu and 
detection vWih lit.<u:*-Ciithicyv(I fluorescence vuUig 
Capillary «lc<rtrM|ilu>jx/ii(» (F«t*co *« al. IPOS- WfU 

flama ci 1996) or acryltuuldc gvkt,- iiiid.O) plate 
capture, and sandwich probe hybrid 1/4 turn (MOl- 
dor «ri cd, 1994}. Although t)u»v «iylhod» prwywl 
successful, eacn inemod requires posl-VCR ma- 
nipulations That add. tin iu to ihu analysh ;md 
may lead u» JubuiaUM y < mil <u I'd nation. The 
sample throughput of Dk'k mrUunU 1* limited 

Mn* rKCjcpuon of the plate cnjnuic ap- 
proach)- and, lhun;frarc. ihcui- met had* ore not 
y«%\\ suited fm do ni*ndiu£ ht£h sample 

thrvu^jjnat (i.e., acTcccihii; of Jar^c numbers of 

I jIlKiuil«A.ul<* in .i(i*.lyxlii£{ JtAjnpIv* ft/r dia£;tOh< 
dc* or clinical trial*}. 

Here ixiporr mc ocveinptiirut nr u novei 
iLVJay fnr cjvoruh Jii va DNA anaryniji, Tb* «wwy is 
l»f«3<pd on t)«c w*c of llic ,V nuclcaae a.uay first 
described by Holbmtl ct Jtl. (1991 J, TJio jiivOjoO 

ih<.. ituolcAtc AtHMiy of V'm; p^lyincraM: i« 
dwvc a noncxicndlhlc liyhrfdi/Hiion prohc tUtr- 

lu^ t*»c «xt«u:«iot« | of 1'C1<- Tin; oji|in/odi 

mo cKioMuuclctl flxjoro^ciHr iiylir^ dilution 
pfalacs (Lcc- c( 3iJ. Ji?P3; DauW «\ ah \9t>$; l.lvo!i 
«•! «K. J Onv flwun7<vnl dy^* wiwo as a 

rcqpewter |PAM (i.e., ti<ar1x'xynuarvacctii)f and 
emission specie* it quenched by Ihc «rce»nd flu<- 
r«Me«iti dyr, (Kr,^ Ci-caHwxy-tclr,imcihyl- 

rlicxlamfnc). Tlic nuclease dcgraidition of the hy- 
hrUHKution \m>hc rcleaiua the Mucuehlfig nf ilie 
I'AM fluorescent. eiiiiir«kiii f re»u)lirtx Jn an In- 
irctoe In peak fi-uorc^cenl cralaaJou <ii nin, 
Tlic USc Of a MOuciKC dctcctnr (AUI I'aun) nllt»w« 
moaiuremsm of fluor«5euuf npiieir^ ui'nil f6 
tif roe. i Hernial cycler conamuuiMy ouhhk ihc 

l*C"«R ampljficitlf.vi./l^hcn-.rim:. \\w niueUuii« «jv 
niofiUorvd ut cvnl lime. TJlC Oiiijna doiit 13 dc- 

scnoea an<» quantitaiwc mudyab or input ui|«ci 
UNA «c^ue-.\ccs u di?c\i>sed lielw. 



RESULTS 



fCR f roducx Dorealon In Rtwl Time 



'Hie jiool wiiK to develop a hSglv-ihroufhput, >eii» 
xU;~«, piwl »iHiirale gene qoyinHaiion HSXuy for 
v?c lo moidUtHrte »pid modialed tlwapovilic 
gtrnc dc«vc;ry. A pi an n Id \incodmc hu^en foctor 
V1U jyetio co<pj«rtce f pI^STM (s^g Mctluidt). was 
uacd Oi a ine>d<il ihcTr&pcuUr K«n<u 'Jlto assny upt<i 
fJu4*f^^cni Tacjiuan mo.ihodalo^y »nd an lnslro- 
' raerjt eaippblc of oieHftwrin^ nuoroncwKc i»i «c»^ 
thrift (Aid Prism 7700 Sequence Hpierior). iiio 
Tai)i«H«o rv^eilojt require n JtyhrldUmtloii jirnlw 
tabled Willi two different fluoruiwem dyes. One 
dyu \% a ro^xirl vr cly<j (l«A*d), the other s quench- 
iiiM Jye (TAMftA). ^heii Uie p^Jki U InlncL (U»o- 
4 e scent energy Irait^fer occ utt «nd the reporter 
dye l/uorcjuumi ei7iix«ion is absorbed hy the* 
qiicndilns $y<t (TAWRA). During Die Qntonahui 
pha«e of the pck cycle. Ihfc fluorcstx'iil bybrid- 
(xjillnn f«rol>c U cleaved by llnr 5'-^' nucleelyiic 
ociivity of the DNA polymerase. On cleavage of 
Ihc probe, the report cvr dye cmisiion it n« lungor 
iraiwfcrrcd efnetcn^y to Hie gucnchtoK djc. to 

ivOtil'H l»i O" 5ii<.r»a*o of tho ruporlor tlyu Huofci- 
ccnt oinaaluiA *jwirn. PCJR $^r;««c^.- |»r«jb»n« 

vrvrv designed ft»» ilie human fucior vill 
Cjnence And huniui p-acfln jjun« dt-ji«:rth*d In 
Methods). OpUinl£Al|on refletlcui> were per- 
rdfmod U* ctioaie trie flpproprluto prubi* ami 
mAgncihun eoneenuolions yielding 'dfilu-j< 
imciviliy of leport.cr fluorescent signal wiihotit 
Huerlflchi^ speeJfieily. Tho Inaifumoni u«<:f a 
clMiKir-coupJcd device (l.c„ CCD entacrd) for 
measuring the fluorwent cuvU^oii «pvclm f/om 
A(><i m C.56 nni. Uach r<:il tul>e w« 3 jnoiUiortsl 
M-ijire»»ti'4lTy fnr 2S> m.*«c wi<l\ cv.nUnuatt* munJ- 
t<»nnA thrvnf^h^ntl «l<« •iM|ilific»ti«n. li*e)i Iwbv 
wa.i re-cxandricd every <l.S >ee. Cc«nipulcr s<>ff- 

Whrc. wn< do^i^ncd 1« <iKa«riti»« flvorcsccnt In- 
ter! Pi ty of boih the rttjMirccr dyt: (FAM).i\nd 
tbc quenching dye (TAMIlA), 'i'he H«cift*ccnt 
intensity of \h<s qu^nchln^ dytt/l'AWJUj dldliAVS 
very fiHle nver «he course of \\w PCk amnHfK 
cation (datn uoi iliuvrn). TW«forfl f the Intensity 
of tavtua dye <?irtissk>fl serves «s an Internal 

.xlaitdard whh which in •iomumIUm lh<j r<j>or|<ir 

y\ r * (FAW) cmlaalon voruinuns. Il^e aofiwarc cn3- 
dilutes <s value tvrmcd AHn (or AKO> uiinj; the 
following equation- Afta - (On 4 ) (Rii"), whe-o 
Hn J - enibsloil Intensity ui Kixarter/einbsiirtj h\- 
leusity oi* cjuenchor nt ony gwen tUl^c In ft ie«c 
tlon tube, and Jhi - vmiMion iMencitily of 7e- 
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poncr/emtosloii ImensJiy of quencher mcuMircd 
prior to 1 J CH 5iin|*Ulicalicni in m»i <ame inaction 
tulic. Yur the purpose of quantitation. UK Um 
three data points (aUnt) rrMiccled during the e** 
tension step for each I J C:H <.ycii- *^rc annlyrxd. 
Tiieniicieoiyticocfiraflartoii orine iiyuiiiifluitoA 
probe occittt during the exte«*xun phase Of nut, 
and, therefore, reporter fluurcsccni cnaiaj on. h*: 
crease* during thfc lime. "Hie <Jit> potirtM 
wre ftverojed for each K^K cyvlc and the uic*u 
value for each w«*i* plotted in an "anipHiieatian 
plot" shown In I'ijjurc 7 A. The Atin mc«ti valu*- i.s 
punted on the v.jixjs, ana nine, represented ijy 
cycic iHtmbtr, iv plot f cut tm (livx-exlt. During ttt« 
early o'dca of (he PCU amplification, thr AUo 



value remalnt at base line When jaif Helen l* hy- 
' binkfjvuilioh probe hits boon cleaved by Uiv Til/| 
polymerase nucleate artlvtry. the Intensity of re- 
port** flut*re*<s;/il enaUuilon lnwviu\%. IA<*\ PCH 
uiiipJlflv* Una* nruch a plateau phun* of reporter 
fluuiearcii'i scriiMfon if me rcmitiun hcarfifrl fun 
tohi^H cycle imoilw. The om^Kf-ailon plot h 
. vxaiuiiKui vutiy in ilm reaction, nr a pulnt thai 
::v,c vii|iufgikiitF Uw fog phaw of prodtid arnunula- 
tioo. This Is douu by MSAifcnln£ an anMbtirv 
ihtahoJd n««i U tuned on the variability of the 
btro-iiac uV*- ln-'KUuns 1 A, liie IhnshftM wa* net 
at ia .ctanrtam Uuvinfions above <hc mrau nf 
Viw lint aflibviun i^lcuLar^d frnni iiydu* 1 lO 3 5. 
Once the threshold ij chosen, the point at which 
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wpure 1 PC* product detection m real t'me.(A) Tnc mooch ttqo *jfiware wtti construct amplification ploti 
trom xh« exion«;on phau fiuorofic^nt emJiiion data conectad during the I'CR amplltouoi%. The standard de- 
vidian is determined Irom the data poinu <oflccl<;d from the bwc line of the amplication ^ ^ lwfet 2 ™ 
colculatcd by determining the poinl rt which the fluorescence exceeds * threshold UmU (usually 10 times; the 
standard deviation of the base line). <«) Overlay or amplification p!a\s of serially (1:2) diluted human genomic 
DMA **n.glos amplififtd with fi^actm pcimoct. (O Innul DNA concentration of th* samptes plotted varsui C r . All 
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the amplification plot erocseo the tlirc:ihoi<J'ifc-ae 
floed as C,.. C,. is reported a* (ho eyelo number :tt 
fid* poiui. ak win be demons rut <ui, the e: f value 
1st jiicJiLlive or the quantity of lnj»v»t t«r^i4. 

Input Tft rgcr Sequence 1 

Plgu/xs IB xhovs amplification yioU of 16«drtTc*- 
tiH PGR amplifications overlaid. The ampiifUM- 
tforu wore performed on a 1i2 serial dl1\itk>n -ufl 
human genomic PNA. *l*hc amplified tar^ci w,*u 
hum/in p nctjn, Tho amplification plot* nhifl to 
th* t'tQhi (to higher threshold cycles) n* the input 
Urge! quantity ic reduced. Thtt Ji Avpcctad ha 
i'jnutt NWCtlmiE with fauitw dtArtins cop it vi of the 
target molecule require coaler ampJIflraUoii to 
degrade enough probe to mrim the threshold 
fluorescence. An arbitrary threshold of JO ,iton- 
dard deviations above Ihr ba.\p line wuk used to 
determine the C r value*. Kriuv IC repi went* the 
C r value* plotted versus the 'ianiplc dlJutluu 
*-oiuc, fcnch dilution was amplified in trfpJlcalc 
W'X amplfflcnhYmc and plotted as meun value* 
with c/ioc bius representing one standard (tavla* 
turn. The Or yhIucs OCXTcasrhiioariy with Jncjvas- 
in? target quantity. Thus, c;, vaIui* can be used 
as (f qvmntirai i~c measurement Of the input target 
number. Tr should be noted that tlu< ampiJficD- 
nan plot for the 15.6- jig sample .ihowrj In Figure 
IB does noi reflect the same fluorescent rate of 
increase exhibited hy most of (he other samples. 
The 15.6-nr; sample nlsu achieve* end not m pia. 
tcvui of n l«.iwpr fluorescent value than would be 
expected liaml on the Input £>NA. JhJa phenom- 
enon Um been observed occasionally wi.Ui other 
ttmplci (daia not sliown) and may be attribute 
able to lata cycle inhibition; thi> hypothesis lv 
Will under (rivcati^fition. U Is important to nore 
that Hut flattened <lrm« and early plateau do nor 
impact friejnlfJcaiitJy the caJculaiKl.C, valuo ac 
dcmoa^nratt?d by the 111 on the line shown jn 
figun* 1 C. All triplicate amplifications rvstihert \n 
*cry jjitnilai Cr values— the standard deviation 
Old nor exceed 0.5 for any dUuOen/TMj experi- 
m«nt contmna « >100 # 000-fo1d range of Jnpui t^r- 
gr:l nifilrrulex Using C, values lor <|iiaiilJ Ullo^i 
permits a much forger assay xonge 1 1 wn direct Jfy 
Uflin? ioirI fiuorcscatt emission intensity for 
(juamliation, Tlw Ihwar range oi Jluorcsccnt In- 
Itnaity measurement of ilic A1W l'ri«n 77on Sc- 



KIAI 1IMI OUANinAUVI VCH 

nuMirc over n very l:u^ range /if r^lallvr* ciarnn^ 
Utrt;H rjiwnti«M5. 

Sample Propararion Vai;<Aatlwi 

Several paraineieri influence the <«fn<Henry nf 
PC;R urnplifieiition: mo^ocaluirt nod >ult coneen- 
triitioiii. rcaalon coa^dftlons; fj-^. t"»ie iam. 
j.«orutuio), PCH (argot sine compos it Ion. 
prinuT sequences, and sample, purtry. AlJ Of The 
.atiova lactors uru common tn a <inpk J*< waay, 
excrjit sample to «inplr purity, in an e^ort to 
validate the method of tampU prcpawuon for 
tltelacior vjji nsjuy, J^CKorapiihcotkm ixprodur^ 
nilUry and oiricJcncy jo tepjicaic sample 
P>v|Mrat!oTK wcuv t!Kamin*d..Aft<:r fccoomle OKA 
wjLt iireparcd fnjjii the to rcpllLUte samples, tnc 
UNA was quantUatcd by ult/avjolct speetroscopy. 
AmpllficQ lions were performed anaryzlnito-aribi 
XCJic corttcnr in 100 auid 2S ngof total kwkmjhc 
UNA, lutch K:K nmplJflcmion was performed in 
UlpUcdU'. C^ampirlNon uf C r values for \^cli trip- 
licate v^mple chuvv trtiolmul variutiou baaed an 
standard deviation and coefficient of variance 
(Vaoio 1). Therefore, each ol the tripiicuo PCll 
ainpllflcutiona w«j hl^ry fcprodUdhlr« # demon. 
s t rating thai re^l time FCR using rh1c inj(r<rrrion- 
tntlon introduces minimal v H riuilnn Into thr 
quajitlu<iv^ j'Cjt wi^irpls. (tinnparlsun (if the. 
mean voiuc^ of m* 10 rty]icote asuplc pr^u- 
rations nla<j ahowed minimal variability, jnotcai- 
incr that c;jch Sample praparataon y>vl<lv<l snnilflr 
rctulxs for u-nclln gene quftfltity. The h; ? hr*1 C r 
difrerence hetwem any nf rbe samples was o.«5 
••mil 0.73 tor tho 100 and 3S ng stimpk% r<i;p«t> 
itv«jy. Addirionally. the ampJIflcation c «f each 
fttmplf. exhilslnal an ct|tilvalcnt rale rjf iluortv. 
ccjtt emission iolcnMty chan^ie per amouni of 
1)NA turret onnly-r.cd 03 indicated by similar 
Mopes derived from (he i,imp)c dmJUoru: (i-^g. i). 
Anyramplc contaioinsyfln excess of a I'CK inhibi- 
tor would exhiltlt a £rt»ate/ measurect (5-aciln <: r 
value for a glv«n quantity of DNA. In addition, 
the Jnliihitor would be diluted alnn? witli thu 
>ample in rhe dllunon artnlysas (Hg. 2), altering 
the expected vilue chaatge, Kach sample am- 
plification yielded n similar rauh In xtxst analysis, 
demonstrating that this method nf sample ytr^. 
ration is liiftiily re^-c^ducihle wfth regard to 
jMnpic purity. 

Quantitative Analvsisof a Pla*mid After 
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TaMv 1. Reproducibility of Sumplo PreparAtldn Mathod 



a 

9 
10 
Mo* 



100 ng 



25 ng 



Samplo 

no. C T 



* tan deed 
mftfcn deviation CV 



18.24 
18.23 
7 8.33 

ia.33 

18.35 
IfUU 

ia.3 

13.3 

18.42 

18.1S 

ia.23 

16.12 

1H.38 

18.54 
18.67 
ip 

18.28 

18.36 

ia_S? 

18.45 

18,7 

18,7? 

18,16 

1834 

18.36 

18.42 

18.57 



1«.27 0.06 

18.1> 0.06 

13.34 0.07 

ie.23 o.oe 

10.42 0.O4 

18.74 0.24 

16.39 0-12 

1B,63 0.16 

18.29 0.1 



0.37 
0,36 

0.23 
1.26 

0.H3 
0„15 



i*.*S 0.12 0,66 
ia,t2 o.i r . o.9o 



20.48 

70.55 

20.5 

20.61 

20.59 

20,41 . 

20.54 

20,6 

20,49 

20.48 

20.44 

20,3 e 

20,68 

20.87 

20,63 

21 .OP 

2104 

21.04 

20.67 

20.73 

20.65 

20,98 

20.84 

20.75 

20,46 

20.54 

20.48 

20.79 

2078 

20.£2 



mofln deviation CV 



20.51 0.03 

P0..H 0,11 

20.54 0.06 

20.41 COS 

70.71 0.13 

21.06 0.03 

20.66 0.04 

20.66 0.U 

20.51 n.o^ 

20.73 0.1 

20.46 0.1P 



0.17 
0..5-I 

0.28 

0.26 
0.61 

o.v< 

0.57 

0.32 

0.-16 
0.94 



tor ixjjuamiOB a partial cUNA /or hua*an factor 
vtn, pi-irru. a acrivj of uantfcctJons w«is set 
\ip uslne a decreeing rniount of UacpUsinid^40, 
4 - 0.5, and o.i ^jg, l\vv-ni>-rour noun poM- 
IroHjfsrtlon, lolnl t>NA wc3 purified (Vtm% enctv 
flask u£ ceila. p-Aelin £wx quautUjr WMi ciioacn 
a <-»k«r for norma Ifcainm of xciiumk: nNA con- 
rnirrauoii lumeadi yaiuulc. ]<i ilds expttiintiHi, 
P-acrin Rvtic comeni should remain enmtam 

rdfillvc hi tOMl UUIUIJIIC UNA. Hjjurv £ xtiow* the 
wult or ihe pectin UNA incwurcmcm (100 iig 
ioioi PNA determined by ultraviolet spectros- 
copy) of mdi t-«u#i*1«. Ka<J\ d&rYiple analyzed 
in triplicate a no me mean j>acun Cy vaiuci of 
the triplicates were plotted (error bats represent 



bvh-*tan *ny sampTa moam WM 0.1*5 C„ 'Jell 
nanograms of total UNA uf Yiclj sa»n ( ->1a woj*4 alro 
examined Am p*4ciln. The results Qgiilu showed 
Umv very similar amount* <if (genomic UNA were, 
prwvfitj *f Jte maximum mean p actio O, v*1 u c 
difference wo.1 1 .0. As J'lfcure 3 ahow.^ ihc rale of 
p-actlu C r uliuriKu l~.-lw«n tho 100 and 10-o£ 
snjnplcA vvaj zimWui (slojjc voJacx ran^j« Wwqcm 

dnJ --3.45). This v«rlR«5 eg«in IhM 
method of i.ftfnplo preparation yicUb of 
idcmlca! ?Ck intcgriry G.<u. no sample contained 
. in CXCC$tlvv amount uf a K:U Inhiuilorj. ITow- 
ever, Ihc^c i<:xa)ts indlcnla l.hat chcIi samplccon 
Ulncd slljjM diffv-ieiicec in th« nctuAl amount of 
^WKunlc llNA aua)y*cd. Uctcnnltiarion of actual 
Kuituijiiv 1>NA voncvsMratiOA v/ot ncc^miptichod 
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10 








19 




i.a 1.4 mi iwo ij- i> * t t,i 

tog (ng infiui oimomic ONA) 

Figure 2 S<< i ipf« preparation puriiy. 1 he replicaht 
»«»pl« ihown fn Table 1 wore also amplified In 
tripicsie using 21 iig of each ONA ifl/rtplo. The 
uie xho*s the input ANA conceniritiian (TOO flfid 
25 np) v;. C, M ih*. f^ura 1h* ton and P«1 1 fig 
poloU for aath (ample are connected by a fine- 



by plollirujj the mcon (*-uctin O-j v a | uc obtained 

f\}T rSlfch UKUiy *HUlp]c till J f—IClltt ^3Ftl*Ttt 
cuiv« (shown in I'lfl. Th</ oaUinl ^ciiumlc 

ONA coiiccialratlim of each Sinnplc, «. wn5 Ob 
taiucd by extrapolation to the i.axit. 

l%vir<i 4 A chow** tlv* -inc*flura<] imwV 
normalised) ^uauUtica \>t JaUur VJ1I plo.rniiil 
(utctmj fmu* each' ufthc four transient cell 
lr*»^f*e4lf>na. Koch rencfltan cnnlaiocd J 00 njrof 
toial sample una (a j dcteriiioiivi ny uv epvetro* 
copy). lUch i«mplv ^un muoJ/iscO in tri pi 



78 
25 



*1 



/«r2L2a«<3.M« f*- i 



-•■-1,0 



SIS 



.5T 




M I -ft 
I03 (ng Inpojl DMA) 

figure 9 Analytic of U/jnif«tcd ceil DNA quantity 

and purity. Hie DNA preparations of. the four 29$ 
C«JI transactions (40, A t OS, and O.Vfig of pFSTM) 
were flnaly7iicl foi rhc 0-axtln gene, 100 <jnd ]0 ng 
(determined by t/icravioiei spectroscopy) lof each 
sample ware jmpllfiod in triplicate, For aach 
ornount of pf 6TM Hut was tramfected, the f3-nciln 
Q values arp plnaH vprxu* rhe tola! Input DNA 



Vr:i< *™pHG cation*. An shown, p|'8']%f purified 
ffarir.fhc 293 cells decrease* (mean C, value* In- 
CHni*£j with ducrca<in# amouuli ]>la«ttnid 
itfuioftilcd. The mean C t values obtained for 
pKbTM «lf» Tl^ure 4 A u/cre platted ujj a id-jrxl 
uurve OOmprl.ied uf vitally diluted pFKTM, 
^hown .in -PJpurt 1 Hie quarHMy ui pi Ki m, u, 
found in *(vch of the four lranfiToctlcms viw de- 
tcrmined by cxrrapnlatian to the x u%h of tJio 
ftiondaxd uurva in l ] i^urv 4J1, 'Hutfio uncorrected 
values, t>, for pFHTM vjert* ]iuruiMll/^rl in rtelcr^ 
mine the a<*<u>l amount of pJ'8'l*X4 found TOO 
/tK <if genomic ONA l»y iliIiir ibc cqutUon:. 



1* x 100 ha 



UCUUl plWIV Wjfiiti ner 

jc)C <#f genomic UNA 



whd/o a actua) (juiionic HNA hi u win pic and 
». jjrjriTW copies /torn the atdudcuO curve- 'Hie 
autm^Jixccl ^uunUty of pl'Ift'M per 100 of /cw- 
nnmlc ONA for e«eii of The Insna/cctioivi Is 
sno>vn m H^tia' *nie%c roulls xhrw ih*t ihc 
quanifiy nf facioT viii piysuud tisvoOatcU vriiii 
the ZtZ> ttlla, 2*1 lu- nficf *r»ny(Vx*iUj<^ iku.i casi^ 
wiui uccrvHaUji: pJHMiili) UJiiufnuacJcju uiod .*u 
ihc tnis>f«cfipn; 'ni*: quantity of pJ'H'J'W ftsaccJ- 
utco: rfUh Z93 cdia, afivr trun.ifcciloji >vhJi ^0 ug 
Of pu^nud, vas 35 pg per J 00 P$ ^unmnlc DNA. 
Tltis rejoin in -520 3>Jtuuiid copies per XvJJ, 



DfSCilSSION 

Wc li/ivc dc5cxibct! a new method Iot quantum- 
j*^ pc4ic copy numbg« *t*ing r^aMlmc «nujy^. 
of i'CK axnp^ficatlnnx. Real-time K:K ?.1 compat^ 
mic with Other of me two KJ< (kt-tcr) ap- 

pA^aditfx (1> <iu«nUUti>v cmnM*^ 1 ^ **l*crc a« 

uiieiJiul cumpctlOT for eacf] rargcr Ncqucnoc b 
used for nomajiJtxaUon (da la not *howu) or (2) 
qiiauttiflUYC comparadvc I'ClK uaJlig ti uoJiuaU^a- 
rtou geue coninincd within ihc sample (i.e.,, p-ac- 
tin} or a "hou*Kke*piJi£" «ene for RT-KIK. 'f 
eyual amountt of nucleic oad arc aa<iJy/ctl f</r 
eacri taruplo autl if the auipltfloKfon vf/uimcy 
bfcrtirt qaantltntlve ailctlyala ia tdcnu'cnl for tndi 
■cinipic, ine inreruaj curnuii (iUMmAi«y-aluau jjwje 
or compctit(Jr) fthcnild Rtvc e^utol x^wh for alJ 
5HinpIcv 

Tlio rcai-tim«? fCK method offers sev^nil ;id- 
vanttiRci over t!ic oihcr two mcthotb toirrcnvly 
employed (see ibe imrudurUan). Mrtt, the real- 
ninc PCK fnethod la ;>crf< »rmcd in doscnMube 
iytwm iuid requires ut> pcKt-IH.U niarilpulauon 
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H.flur* 14 au»ntlt*lK/o flna/yM of pFSTM in transitu* ex:!/*. <>U Amount of 
piaimfd OisiA u*,<| for ihg twnifrcilon plotted OfiAinst li.u h.umi C vufuc deter- 
7Su j P a , ^ hr after Ucnjfoctton. (a,Q Stands o,rvca of 

p*.flTM and p^<Ur% r«pccdvoly. |,rgTM ONA <*) and jjcnomlc 1>N/N (Q we* 
diluted My 1 ;5 before ampWIttiton wkK tK« cippropcW p r(me rs. The H-actin 



*ppf*>p««te primers. 
, n * — 'w cf A lo 1 00 f ijj of e, 

lO) The amount ol pfSTM present j** ioo n fl erf nc nomfc oma. 



i , . - ^ rr ,w^,, wsc piimcrs. inc H-ac(«n 

•bwdard curv* wan Uiod to n^rn^lu^ iK c ,„ U | U cf Alo 100 f|JJ ofcJ enomK: ONA. 



r*f HAm,tf v . Therefore, fit.. |w,umii*1 for PCH con- 
Iflmlnaiitm in \hc Jul*.r*k.f> reduced because 
AAipllIlud proJucu can hvan^yroO and disposed 
of %^ibum opening xiw t«u<*w | U bc ? . Second, 
ihis nurfW *«p|>o«in die u.mt iif a imnnuli^Uon 
«ono (U„ fj-eciinj /or auan«i4«liv« PCR e-rnooae- 
keeping gen** for <jti*n(iintWc RT-l'Ck control 
Analyrts h performed In real tune during iho Jo A 
phase of product accumulation. AnatysU during 
Jom phw pennib many different ^enc* (ever * 
rrl<ic input carftct ran^r) to (> c analysed .ifmulla/ 
ncuusiy. wlthoiti concern of reaching renclion 
ybtomt «t difleicnl cycles, Itua will uiak* mulll- 
$cne «n*iv>;* a*j« y , much cojtiei tv; develop, be- 
come individual interna] uinipciUuia will nui be 
needed for eneb gene under analyola. Tbird, 
>4inplc th'c«idipu« will Uuicd>c UitiinttU^ny 
-wiih the ucw nietnod bochune tlu-ie h no punt. 
J'CU proccnrinn ihne. AddHiooally. wrukl<iQ in a 
:)0-w<j« foniibC Is hi^lily ciM«|nitiWe v/dh auto- 
tuition leschnolojjy, 

TMe redKiuu; 1>CU uxjlbcM 5s li^hly rcprr- 
dueibk. Acp]k»ic ampilflcuxions can be eiiHly^ed 



far W h Simple nUij|mMn a jHMcrji lat ent ir. Ilni. 
*y<unn <Vt a very ]oj-£c 4,NSay dynamic 

nwige (a|i|iToac1itn$ l.OOO.OOO-^ul inning Ui- 
Ji<Jl). Udtng MamlqrJ eurvc £br\hc target 0/ Ifi- 
ten**!, relative carry nuniinrr vaiuw can be deter- 
mined /or any uitluw^i ^mple, Wu«r«cent 
lhr«lt<i|<l yqJue* t O r , eo/ieJair linearly *vim rela. 
live i>NA copy numbers K«iJ time qua mlta live 
Kl^HJl mclhodolr- ft y «iiu^n a al u this l«u^ 
h<M «lw been developed. ^Irwlly,. real Uimc C|u^i»- 
tUarlvr |-Citin«tli<KJoIo5y c«il be u>cd tudcvcJup 
high-*hr«»u£hpm ^eicenln^ €i3.iayA for o variety of 
duplications fquaiitJlnllvc gcn<» cw^wmuu (K'J*- 
rCR) ( n <in« o> 7 >y «a. aya (Hcrfc, JJIV, etc). XCnfl- 
inotjse £nu]ysi4) 4 and Jniciumr- 

nenj. 

lUiiMiuic P<^K inly also lie performed tijii»|j 
Jntcrail«tii\fi ttyc* (Uigur*; c.\ ui- pvieh to 

oUvldium bromide. The fluoro^enic prohe 
method o^ers a ma|or :idwant4ftQ over irner- 
<ulalin£ dy<a— -^rcatex *yccJfJcrty (I.e., pniner 
dimera inid nonsperifle PCM prnduct« am not de- 
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METHODS 

Ganoraijon of 4 PtafmW Com*] nine a Vorrtal 
cDNA far Human Factor Ylil 

YtlUil HNA hurwjtetf (KNA"-* U (nun 'M'lVetr lue„ 
)-f>VM<Uwood, TK> frwtu «.-vJ1> l.^o/Vclcd a iactur V1U 
«H«|ifc»Uiii ^lo*. pCtiLttvftJU (Kaum.ul «l. ltftto; dor- 
«iun <i at a ftieiur v|ll partial clnsM wrapMf kv WAS 

ff»*T»»uo uy irr i*» k:<h««aq\> ja rnu uw* ITU Vu 

(p«rt N<KlfMir/«f. i*£ Applied HU)5y£Can\ WUvi Oiy, <.ZAJ/ 
«>lnf, ilic H<:u pnuiefs J***f<»r MUfrv (pi?m<«r frauciirr* 
ut+ shown b^Juvv), "Hiv j in pi Icon wax scampi If l*cl uMnc 
modifrU Hifoi and Wrcv primers Uipjitiulvd win Httuttu 
Ui*rx\\\} rol/ltfJnn jrfrc: sequence* l» Uw VJ'Hf and 
doiuxf l« ii« /Kim- j2 (Trwp«*« Cinrp.. M*duou, VI). n,c 
rcfluk Intone, |iKlH w« u.vtl lor iraiuiv*" iraiufccilon 



Amplification of Target DMA ami Dueciion of 
AmpJicon Factor VUI Flacmid UNA 

(pVHTM) tr»J mnfrilftcU Willi till? |«lnma WWof 5'«<XX>- 
<JV< l< U^/i<JAVl*JtH a J i Crt*0,V and l=*riV .1 >a AA< x?r« 
<UAOCTOCAT(«l/rAon^.11icnmlltuj javduml.tt 
Up la'.K product. *J^c fnnviird juiim-r woa vlwdxuvcl w Kv 
ngnlrc a iwilqyv MH|m*itu< <tni»d In tkc 5 1 untrortllfltcd 
rqpoii «f iiiu tnuviii pCiS2.*<.eSi* nlo>n.J\| <md iWcforc 

tXiMti |«mI iwi ft )iUu «<rd Amplify Ultf l!iftlI0ff AlCllir VMI 
Stfti* I'mnw »vvr* chunau with Wv awtouir fit Ihf- com- 
puter piogrtm QWfi* <MI (Niilmrml IllMSCkurvclnC I4y. 
mmitk, UN). Tb* human p-a«l« f mr.^ umpllflcd vrlih 
ilwr prime* p-«v<;.» /wt«iinl prfmcf-S'-'PCACOCACAi;!* JT 
CC^CAlXTi'ACicrAO* and {^udWj reverse .pjkner .s'.I:a<; 
tXCAACCX.'CJtlA'rrCJ^OA'JX/a-a'. The rcnalon p/O- 

Aiiipl/ficfliloa rwciion* (SO »J) cuiiiditiptl s l)N/\ 
firmpl^ H>x P<a< H«fr«r il (f» >4), 20a UAIr. UCir. 
<*GTP. «/i<< 4(10 j\M dlll lV 4 1PM Mica* (/nils Ampi/ 
7ii lf rJN/v potymcxQK, unit AfiiprJTiJc iirncii N.jjiy« 

wn/Hw (UNC), AftgimoI«*.f fuctoi VIII jylmvi/oftd 1J? 
on« <if tlitr foM««-4i»^ ^ V cil<tii |**<Hkv< f KJu nM rorliK 

nCtrrrCTAXIRA^ J' « u <i p-nviln vwbc 5' (TAM)atc;ux:- 
XC]'AMiU)C0CCr^TCC:CATC,^3' «lirrr p ln<ll<ii« 
plmflfUvo^Al^n una X In^coici a linker jnu nudcotUlo 
J1«»<Uom Uilv* 'wv MScaiAmp Optical 'W&Ki (pAtl AUm- 

Wr iowii <»3.i # i>btW« niuu,'^ liiai tvow frouMl (Kt iVrtiin 
Rtfuvi-) ^xi.l Kg Iii (mm /edecttn^* *J\ibc cap*, wore 
£lniiUv 1« MU««oAmp Oupj jpr«(n|Iv dcal^aC<t iO p«v 
>«f«i li^lu swKvpuf. AH <.l ilrr- KUlaiiMuinnliU^ vycto 
r lu*l 1^ W; A^^IUI lUusyjWim <|lo>U«r Cflty, CA1 cJlccpl 
r<icu»r vm ptittjcra, t*1i1v1« wnr jyiitfifjlyru ai Cciivm 
inc. (iouOi ? ( m fVttiicbcu, CAK I»taMics -iix- doJyntd 
utli^g Olijja ^-0 .^rivvafv, MIcnvJilE tfllilt'lfiiqti gilil- 

KC$(*a in mc MwU-| 77«) .If^ucrtcc lmwur liijuiuiiivnl 
fnuittial, HrK-nr* prouc 1*^ sJiihiUI hr ill Jcajl iHchrr 
mini cur Aiiuvutli^ triuiA'Mturt; airtl tlurl/u: <»»<'fm(il cf- . 
rliii^; prtin^ iliovM hoi A«hm ai^jK 4 iiupl«t.r with liw 

VUC/HJaI »yc"H^ CUiidiilun* lnrlut\u«3 i.iuln ivl 
5U*C and JO mill ut 95 r C 71i«nirjl tyring pw<n*i3rd Mf/i»i 



reactions wprr iietfonneci in ihi« Mfwa«( 7 70(1 Sequence IV- . 
l«K»f n«r. AOPHCd muuyUi-uiO, «#h»rh ohvliUx < C<^>^ 
AJufi IM^H Syrt<mi P«T>. motion caiullliat.* w«-*y. pvn, 
ftmafUtttl tfli 4 i*tfw«if !^Ucm1u<h yiOCl (Apnlc (Xmi|tiifar, 

Sanva uara, t:A> unken dimply m iru? Motk-J V7rm Si*. 

quojitx* IVtdoctor. Aii^ly«U 4tf daU w»« atcu |wWfirm*-d «mi 
Uur V4-«-lnU«*^ «omyjMtvr. {*<*i\*r\\mt aiiJ «unly«U voffw^rc 



Transection of Cells whK r»<eor VIH G»iurirucl 

J*ilUTT175 flajVJ Of MX CCUS OltVtOa. 1X73), ? humsn 
ftial Kkll^cy fUflH'nwon c«n lllie, wtn- ginwn (o RlVK. can- 
Miicncy on»1 trantfCCffd pIVni CX'Hs wort* wnwn Ui Oiu. 
KilMwliij inodW MAMiiHi uutnauf CUT, .W. 
lJucoic J HiJUcctMi'ii in^i-'Jcd Ka^ic medium (OMHM) With. 
<un fiTycinc w(0i flodlum bioiTbdfiair, icjot l^ul imvlnr 
^runt, X him i.-^JuUimjK, and 1W» p€nid/JiiH«fcp«omy- 

tlflX "Jlic m<idU Mai d«uf*^fr<l JO mLi h*c-f <— ^— 1 1 « <- lr«>«cf«« 
lion. pl^U'rW PNA omouuio of 40, 4, (U* t <ind OJ ja* w V a^ 
;idf»0fi in 1.6 ml of a Bohu!u»i <jorU«lnln* 0,1 is « 
«nd 1 x UUl'liS. Tho fowi n*i«»un* w«re k«ft ai mom I em-' 
fJtt/aUm- f<r- 1U rt«l«> imi4 ihvtt i*«J«U-«l H-«.,«—| M . U«c i>aJ! 3 , 
'11 ic n-kkk -vv/...*.Jj*lcJ «(. 37»C and r «C (*X> 2 f«r 24- hr, 
•wiulscd wUh !HIK. «".tl ^u^p^dod In PUS. Knus 

TTncrrd immediately utotfi UivQJAufnp K» (QUg^n, 

Omt^mntif. <.*A>. I>NA c-.K^cd lain 300 r l fc »l 20 m«i- 
Tf4i.Ua u( pi 1 HjO. 
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methods. Peptides AENKor AEQKwcrc dissolved in water, made Uownic with 
Nad and diluted inco RPM1 growth medium. T-eett-proUferaoon w»y* were 
done e«cnujlry as described 1011 . Briefly, after inrigen pulsing pOii-gmT 
TTCn with cecnpeptides (l-ZmgmT 1 ). PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% giutaraldehydc Glycine was add*d 
co a ami concentration of O.lM and the cells were washed five times in KPM1 
1640 medium containing 1% PCS before co-culture with T-ccU clones in 
round-boaom 96-well microtia* plates. After 4ft h. the cultures were pulsed 
wich i p.Ci of 'H-thymidinc and harvested for scintillation counting 16 h later. 
Predigesuon of native TTCF was done by incubating 200 h-6 TTCF with 0.25 jxg 
pig kidney leguenain in 500 u4 50 mM citrate buffo; pH SS t for I h at 37 "C 
Glycopoptlde digestion*, the peptides HiDNEEDl. HlPN(N-gluco^minc) 
EEDI and HlOKHSDl. which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK) . which a based on human transferrin, 
were obtained by custom synthesis. The three C-icrminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopcptide 
QQQHUGSNVTDCSGNFCLFR was prepared by trypcic (Promega) digestion 
of 5 rag reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopcptides corresponding to. residues 
602-642 and d2 1-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-tcrminal sequencing The lyophilizcd transferrin- 
derived peptides were ccdissoWed in 50 mM sodium acetate. pH 5.5. 10 mM 
dithioihreitol 20% methanol. Digestion were performed for 3 h at 30 - C with 
5-50 mUml' 1 pig kidney legumain or B-cell AEP. Products were analysed by 
HPLC or MALDf-TOP mass spectrometry using a matrix of lOmgmr 1 o> 
cyanocinruouc acid in 50% acctonitrile/0-1% TfA and a PcrSeptivc Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal itiafa* 



dizarion was obtained with a matrix ion of 568.13 mass units 

Kecked W S4>«mbtn 3 Nc^onbcr >99ft. 
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Fas Ugund (PastrU^roduced by activated T cells and natural | 
killer cells ai>d it Induces apoptosis (prograinmcd cell death) in > 
target celirthroilgh the death receptor Fis/Apol/CD95 (rcf. 1). 
One.impbrtont role of Fast and Fas U to mediate immune- 
fef&faSsr killing of cells that arc pocennally hannful to the 
^organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
t receptor 3 (DcR3). that binds to Fasl and inhibits FasL-induced 
apoptosis. The DcH3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent irnmune-cytoioxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) j$cnt superfamily 1 . Using 
the overlapping sequence, we isolaied a previously unknown full- 
length complementary DNA from hunwn fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members oi 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteinc-rich domains 
(CRDs). Uke one other TNFR homologue. osteoprotegerin (OPG) , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a me mbrane-assco dated, 
molecule. We expressed a recombinant, bistidine-tagged form of 
Ddl3 in rnammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamidc gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 share-; 
sequence identity in particular with OPG (31%) and TNFR. 1 - 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OI>C are conserved; 
however, the carboxy-tcrrrunal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, coloa 
and lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DclO, w C generated 
, a recombinant, Fc-tagged DcR3 protein- We tested binding of 
DcR3-Fc to human 293 cells transfected with mdmdual TNI- 
family Ugands. which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini oi 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 1 (Fig. 2a), but not to cells transfeaed with 
TNF Apo2UTRAlL w , Apo3L/TWEAK 4 \ or OPGL/TRANCfcV 
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raNKL"- 11 (data not shown). DcR3-Fc immunopredpiwed shed 
Fasttransfected 293 cells (fig- 2b) and F"**-^ 
FasL (Fig. 2c). as did the Fc-tagged ectodomaxn of Fas to not 

soluble FasL formed a stable complex (Fig. 2d). Equihbnum 
Z£L indicated that Dc*3-Fc and Fas-Fc bound M. solublj 
FasL with a comparable affinity <X = 0.8 S 0 2 and 
1 1 ±0.1nM. respeaivelyi Fig- 2e), and that DcR3-Fc _could 
Mock nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset) Thus. DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL actraty, we 
tested the effect of DcR3-Fe on apoptosis induction by soluble 
FasL in Jurl«t T leukaemia cells, which express Fas (Fig. 3a). DeR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inh&.uon , « 
-0 1 agml-. Time-course analysis showed that the uldrt«n * d 
not merely delay cell death, but rather persisted for at least 24 hours 
?Re 3b) We also tested the effect of DcR3-Fc on aet.vat.on- 
Lxduccd "cell death (AICD) of mature T lymphocyte* i iFasL- 
dependent process'. Consistent with prevtous results \ ampnon 
of interleukin-2-sdmula«d CD4-posiuve T cells with anu-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
Wocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



ajc l v l a t p a L u s 4 v| 

— — CRD1 



induction of apoptosis to a similar extent. Thus. DcR3 binding 
blocks apoptosis induction by FasL ... c • 

FasL-induced apoptosis U important in damnation of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes: an alternative mechanism involves perforin and 
Branrymes' "-". Peripheral blood natural killer cclU 
marked cell death in Jurkat T leukaemia cells (Eg. 3d); DcR3-Ec^ 
and Fas-Fc each reduced killing of target cells from -6%k ^» 
-30%. with half-maximal inhibition at -1 ngml ; ^rts^uaU 
killing was probably mediated by the perforin/grawirme^w 
Thus. DcR3 binding blocks FasL-dcpendent natu^ TcjUer ceU 
activiry. Higher DcR3-Fc and Fas-Fc concentrations vfere required 
to block natural killer cell activity compared vith-those required to 
block soluble FasL activity, which is cona?ent ir w<th the greater 
potency of membrane-associated FasL compared with soluble 

FasL". ,*w V" .... , 

Given the role of immun^cftotoxic cells in ehm.nanon of 
tumour cells and the fact A.^DoR3 can act as an mkbtwr of 
FasL, we proposed that DtB3*xpression ought contribute to the 
ability of some rumoursWo"«cape immune-cywtonc attack. As 
Benomic amplification frequently contributes to tumongenesis, we 
Investigated wnft»4u DcR3 gene is amphhed in cancer. We 
analysed D.CR3 gene-copy number by quantitative polymerase chain 
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F, S ure 1 Primary struetore and expreseion of human OeM. e. Alignment o the 
amino-acld sequences ot Ocfl3 end of ostcoprotegerin (OPGt the C-«erm,nei 101 
residues of OPG sre net shown. The putative signal cleavage she (arrow), the 
cys woe-rich domeins [CBO l -4). snd the/v4inke<J glycosylate* she (asterwk) are 
snown. b expression of OeP.3 mftNA. Modern hytjrtdttetion snaiysis was done 
uS lng <M 0cfi3 eONA as a probe and Woo of peVM" «NA (Clontech) from 
human fami and eduit dscuas cr eencer eeli (nee. PBL peripheral blooc 
lymphocyte. 

Njn« O M«en>«ljn Pue<lih«« Ua 1»«a 



FI 8U « 2 interaction ot DcFQ with FesL «. 293 caJe were transfeeted^wlth pfl« 
vector (top) or with pRW encoding fuWartfh FasL (bottom), incubated wUh 
0CB3-PC "olid line, sheded sroa). TNFR.-Pc (dotted line) or cuffer com o, 
(dashed line) (the dashed and doned lines ovoriao). end analysed for b.ndlng by 
FACS Statistical anavsis showed a signr^^^^ 
Oioding of OCRS-Fc » cells trandacted with Feel or pRKS. PE. 

cell supematanta were Immuncpreelpltsted with Fc-tagged TNFfll. OcR3 or F*e. 
e Purified soluble FasL(sFasUwes immuncpree.p«o<ed wlthTNr-Bl-Fc. 0cR3- 
Fc or Fes-Fc end visualiwd by immunobiot with ano-FaaL antibody. sFasL wae 
loaded direcdy for ccmperison tn the righH>and lane d. ■**<™#**^ 
locubeBd with OCW-FC or with buffet and n : ».v 8 c by go. Hkrat.oa colu.nn 
rraaions wem araiysed in en essay that de^ 

. end sFssL-Flag. e. Equilibrium binding of DcR3-Fc or Fas-Fc » sFasL-FUg 
Inset, compoddon of 0cR3-Fc wfo, Fas-Fc lor olnd.ng to sFasL-Hag. 
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reaction (PCR)" in gnomic DNA from 35 primary lung and colon 
™ \£L t0 pooled genomic DNA from peripheral blood 
jSfil of 10 hJdv donors. Eight of 18 lung ; tnjM 
Ind 9 of 7 colon tumours showed DcR3 gene ampUficauon. 
"ngmg from 2 to 18 -fold (Fig. 4* b). To confirm dus ~ 
^aSthe colon tumour DNAs with three more, independent sets 
3$S£S?CR primers and probes: we observed nearly the 

tissue sections by in stru hybridization. We detected DcR3 

breast tumours, and 1 out of 1 gastric tumour (data not :shown). A 
SonSough a squamous-cdl carcinoma of the lung is shown ui 
£T DcR^mRNA was localized to infiltrating mahgnan, : 
Uukbut was essentially absent from adjacent stroma, tndicaung 
n^ur pedfioLpression. Although the individual naam tfta- 

^different, the in sir* hybridization results are consistent wtth 
^findm Si *e DcR3 gene is amplified frequently mt— 
SW4M Jlon carcinoma cells, which showed abundant DcR3 
mRNA egression (Fig. lb), also had marked DcR3 gene ^t^' 

hvbridiaation (fivefold) (data not shown). , 

DcR3 ampUfication in cancer is funcaonally relevant, then 
DcB 3 should be ampUfied mote than rteighbourutje geno^uc 
regions that are not important for tumour sumval. To test thi . 



we mapped the human DcR3 gene by ""J™! 

DdS showed linkage to marker AFM21ftxe7 (T160 . which maps to 

ch^sU pSon 20ql3. Next, we isolated from a baetenal 

aruficial chromosome (BAG) library a human genonuc doneriut 

carries DcR3. and sequenced the ends of the done s insert. We ±en 

determined, from the nine colon tumours that showed twofoW I or 

e£S ampUfication of DcR3. the copy number of the DcB* 

CL^quences (teverse and forward) from the *AC jmi of 

S genomic markers that span chromosome 20 

SSed reverse marker showed an average •»j&f&* 

roughly threefold, sUghdy less than the W^Wjfi"™ 

wpUtotion of DcR3; the other 

ampUfication. These data indicate that DdOmay 

centre' of a distal chromosome 20 region shitjs-anipufied n co on 

See, constat with the possibility th>DcR3 ampUficauon 

Wd flmay members; however, this does not rule out the possi- 
Sy that DcR3 ifit^cti^itK other ligands, as do some other 

TNFR family members, including OPG~ - 

FasL is import in regulating the immune response; however, 
little is known abbot how FasL function is controlled. One mechan- 
S TinvSyes me molecule cFLlP. which modulates apoptosis signal- 
um » « d rocC hanism involves proteolytic 
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Figure 3 mhlbition cf FasL activity by DeR3. a. Human iurkat T ■**»""^ 
were incited w«h Fagged soiub.e FasL (sFasU Sng mf, o.JomedKd 
wim ana-FUg anybody (0., *,mr<) In the presence of the proposed 
OcR3-Fc Fas-Fc or human igGi end assayed fcr apoptosis (mean - s .«n. of 
.ripiieaiealbjuAetealls were incubated with eFasl-Rac «*■ ***** ^ b< ** 

numan l 9 Gl (mangles), and spoptosle was <«M i*«M ^SJJJ 
points e fWphMLMoOd T calls wera Stimulated with PHA and ImeneuWn* 
Lwad by comrol (whhe bars, or e*rC03 anybody 
P hoephate*uff«red salina (P6S). human .gGl Fes-Fc. or ^ * 

five donors), d. Peripheral blood natural killer celle were Incubated w.h Cr 
, 3b eli0d Jurkat ceils in the presence ol OcM-Fc (filled drctes). Fas-Fc (open 
circles) or human IgGl (triangles), and urges-call deatn was detained by 
release of s 'Cf (mean r s.d. for two donors, each in triplicate). 



F. 9 ur.aGenomlcampliP<a<ionofDcR^ 

el 8 madenc«arcinom aS (c.d.f. 8 .h.i.k.r).sevan«uamouS<e.lC3r^om a s(.^ 

P. d). one ^ carcinoma (b). one 
one bronch,a. adenocarcinoma (I). The data are means = S.d. ot Z 
done in duplicate, b. Colon tumoura. comprint, ,7 adenocarcinomas. Date are 
In = a.m. of ft. experiments done in du«. e. * * 
aniria of OcR3 mRNA expression ,n e equamous-eell caremome of me lung. A 
rSamaUve bright-hald ,mage (.eft) and the correspond^ derk-ne.d -mage 
W ho show DcA3 mRNA over InhhraCng mel.gr.sn, eplthekum (arrowh^da^ 
Seem non-malignant stroma (S,. blocd vecse. M and neemde tumour.,,,e 
,N are also shown, d. Average ampliation or OcB3 compered , amp. B c* 
S, J neighbouring genomic regions (reverse end (o-ward. Rev and ^-ft*. 
0cR3-nnked marker T160. and other chromoaoma-20 marvera. In me nine colon 

experimen (9 done In duplicate. A5ter.sk inolcstei P < 0.01 tor 
comparing each marker with 0cR3. 
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IgG. blocked with 2% BSAin PBS. DcR3-Fc or fas-Pc w« added, followed by 
serially diluted Fhg-u 6 6<d soluble FasL. Bound ligand wis detected with anti- 
Flag antibody as above. In ihe cornpecirion amy, Fw-Fc was immobilued is 
above, and the weUs were blocked with excess IgGl before addition of FUg- 
ugged soluble FasL plus DcR3-Fc. 

T-cell A1CD. CD3 T lymphocytes were isolated from peripheral blood of 
individual donors using anu-CD3 magnetic beads (MiUenyi Biotech), 
stimulated with phywhaenwgglutinin (PHA; I *g aJ ') for 24 h. and cj^U«^ * . 
in chc presence of interleukin-Z (100 U mf) for 5 days. Ihc eelk ^ gurii*^ 
wells coated wiih anti-CD3 antibody (Phaiminf.en) and analysed <Sr a^o^ojfts 
16 h Uier by FaCS analysis of annexm-V-binding of CO** cdjb^. r f_f 
Natural killer cell actfviry. Natural killer cells were isolated 6bm peripheral 
blood of individuol donors using anti-CDS* wigficpc beads (MUtcayi 
Biotech), and incubated for I6h with "Cr- loaded ij^ceus at an effector- 
to-iirgct ratio of 1:1 in the presence of D£U-PcJf«-fc or human IgGl. 
Target-cell death was determined by rdeasc'o^'Cr in effector- target co- 
cultures relative to release of 5l Cr by degrjam lysis of equal numbers of furkat 
cells. 

G«n©-BmpHncatlon analys1s/$urgicil specimeos were provided by J. Kern 
(lung tumours) and P. Qutf£.(c*lxin tumours). Genomic DNA was extracted 
(Qiagen) and the coticentration was determined using Hoeciut dye 33254 
intercalation fluocomcaje Amplification was determined by quantitative PGR" 
' usmgaTaqMt{n1nWmenc(ABI).Tne method wat validated by comparison of 
PGR and Southern hybridization data for the Myc and HER-2 oncogenes (data 
nof sho^wn^ VGenc-specinc primers and fluorogcruV. probes were designed on 
> r -h Bv inc basis of the sequence of DcR3 or of nearby regions identified on a BaC 
EST consensus; fetal lung was positive for a product of the expected sue. By alternatively, primers and probe* were based 
hybridization to a PCR-generaced probe ^^.^^^^^ Human Genome Center marker AfM2l3«7 (TWO), which is 
(DNA30942) was idenrined. When searching for pocenual dterrunvdy sphced^ Vm ^untom numa ^ ^ u _ 

forms of DcJO that might encode a innsmembrane 
more clones; the coding regions of these clones were identical ut-sac tahai of 



FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interlcukin-l has been 
described 21 . In addition, two decoy receptors that belong w the 
TNFR kmily. DclU and DclU, regulate the Fasl-related apoptosis- 
inducing molecule Apo2l". Unlike DcRl and DcR2. which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TOTR-fc^ 'J* 
0?G\ which shares greater sequence homology with DcR3 (3 iToj 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L". Thus. DcR3 and OPG define a new subset of 
TNFR-fcmily members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNr-fanuly UganOs, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, producnon of a soluble 
decoy receptor for FasL may contribute to immune evasion by 
certain tumours. 

Methods 

Isolation of OcR3 cDNA. Several overlapping EST* in GenB ^(accession 
numbers AA02S672. AA025«73 and W67S60) and in larcseo, (Incyte 
Pharmaceuticals: accession numbers 1339238, 1533571. 1533650. 1542861. 
1769372 and 2207027} showed similarity to members of the TNfR family. We 
screened human cDNA libraries by PCR with primers based on che region of 
EST consensus; fetal lung was positive for a product of the expected size. By 



the initial done (data ooc shown). 



Fc-tusloo proteins (Immunosdhoslna). The entire DdW sequence, or the 
ertodomain of Fas or TNfRl. was fused to the Wnge&n'd Ft region of human 
IgGl, expressed in insect SF9 cells or in hurnaS^WcUl, and purified as 



described 13 



Fluoreaconco-actJvatad cell sorting (FACS) analyais. We trans fected 293 
ceils using calcium phosphate or pfectctwlQiagcn) w«h pRKS vector or pRK5 
encoding full-length human Fas^ (2 jig), together with pRKS encoding CrmA 
(2h-6) » P"™ 1 C * U ^di! After 16 h. the cells were incubated with 
biotinyUted DcRJ-Fc or T^FRl-Fc and (hen wth phycoerythrin -conjugated 
streptavidin (GibcpBiailil'and were assayed by FaCS. The dau were analysed by 
Kolmogomx T Smirnov , 'statistical analysis There was some detectable staining 
of vcctQC-UMUfedd cells by DclU-fc; aa these cells express lioie fast (data 
not shown)";!* is possn>le thai DcIO recognized some other factor that is 
cxpt'esscd coiwritutively on 293 cells. 

ImmunoprocipiUtlon. Human 293 cells were transacted as above, and 
metaboUcally labelled with ( 5i S]cystcinc and ( >J S] methionine (0.5 md; 
Amersham). After I6h of culture in the presence of z-VAD-fmk (lOjtM). 
the medium was immunoprecipitated with DcR3-Fc. Fas-Fc or TWRl-Fc 
(5 u>g), fbUowed by protein A-Scphamsc (Repligen). The precipitates were 
resolved by SDS-PACE and visualized on a phosphorimagcr (Fuji BAS2000). 
Alternatively, purified, flas-taggcd soluble FasL (1 n-g) WM ^cubatcd 

with each Fcfusion protein (1 ug). precipitated with protein A-Scpharosc, 
resolved by SDS-PAGE and visualized by iramunoblotring with, rabbit and- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25 o-e) was 
incubated with buffer or with DcR3-Fc (40 ug) for 1.5 h at 24 X- The reiction 
was loaded onto a 5uperdcx 200 HR 10/30 column (Pharmacia) and developed 
with PBS: 0-6-ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 yU aliquocs into microtiue 
wells prccoated with anti-human IgG (Boehringer) to capture DcR3-Fc 
followed by detection with biotinyUted anti-Fag antibody Bio M2 (Kodak! and 
sirepcavidin-horscradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complexof 420K (data not 
shown), which iJ consistent with a sioichiomctry of two DcIU-Fc horaodimcrs 
to two soluble FasL homoarimcrs. 
S equilibrium binding analysis. Microtiue w c Us were coated with anti-human 



linked to DcR3 (likelihood score = 5.4), SHGCX>626« (T159). the nearest 
available marker which maps to -500 kiiobases from T»60, and five extra 
markers chat span chromosome 20. The DcR3-specific primer sequences were 
5'-OTCTTCCCGCACGCTG-3' and S'-aTCaCGCCGGCaCCaG-3" and the 
auorogeoic probe sequence was 5' -(f aM-ACaCG ATGCGTGCTCCAAGCaG 
AAp-(TAMARA). where FAM is 5'-fluorescein phosphoramiditc. Relative 
gene-copy numbers were derived using the formula 2 (aCT \ where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DMA compared to test DNA. 

Rocciv«<J 24 Scp«ml>cf: t capuJ S Movoribcf l«fl. ^ 
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Crystal structure of the 
ATP-binding subunit 
of an ABC transporter 

U-Wel Hung*, Iris Xiaoyan Wangt, Kishlko Nlkaidot,, 
PoWi Uuf. ©ovanna Perro-Luzri Amest & Sung-Hou Kan-* 

• £ O. luwrence Berkeley National Laboratory, t 

CcB ffefaj* W * D**«"«« ofCh<m«ry. Vnb.cnte, ofCahfbnua a, Berkeley 
Berkeley, California 9*720. USA 

ABC inuuponc» (^o known a, traffic ATPases) form a large 
family of proteins responsible for the translocaaon of a variety 
oTXounds across membranes of both prokaryotes and 
eukaryotes'. The recently completed Eschcnchta col, ~ 
sequence revealed that the largest family of paral 
p^teins is composed of ABC transport^- Mm^riryoac 
Steins of medical significance belong to -^W*^ J 
the cystic fibrosis transmembrane condnctanccxegulator (CTTR), 
the P-glycoprotein (or muludrag-resistancc^rotem) and the 
heterodimeric transporter associoted^th-anhgen processus 
(Tapl-Tap2). Here we report the crystaL'.sjructure at 1.5 A resolu- 
tioii of HbP, the ATP-binding suW of the hiatid^e permease 
which is an ABC transporter from ^fmonellfl typh.mur.um. We 
correlate the details of ^structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins, lie structure provides a basis for upstanding 
properties of ABC transporters and of defective CFTR. proteins. 

ABC ttarispoW contain four structural domains: two nudeo- 
tide-bindlhg; domains (NBDs), which arc highly- J"*"** 
throughout the family, and two transmembrane domains Xn 
WwUes these domains are often separate subumts which are 
Lembled into a membrane-bound complex in eukaryotes *he 
domains arc genoally fused into a single polypeptide chain. The 
periplasm* histidine permease of S. typhmunum »«» d£ c f » * 
wett-characterized ABC transporter that is a good ~^.JjJ« 
supcrfamUy. It consists of a membrane-bound complex. "HigMftj 
which comprises integral membrane subumts, HbQ and-H^M. and 
Z copies P of HisP the ATP-binding subunit. HisP .which ,hj 
properties intermediate between those of integral and peripheral 
Lmbrane proteins'. is accessible fromboth sides o the. membrane 
presumably by its interaction with HisQ and HtsM'- Thetwo HisP 
wbunits form a dimer. as shown by their cooperauv,ty. m ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
acdvit/. and the formation ofa HisP dimer upon chemical. cross. 
Long Soluble HisP ako forms a dimer'. HisP has beer i punfied 
and characterised in an active soluble form 1 whidi can be recon- 
stituted into a fully active membrane-bound comply. 

The overall sliapc of the crystal structure of the HisP ™° om « 15 
that of an T with two thick arms (ami I and arm 111: ****** 
binding pocket is new the end of arm I (Fig. 1). A six-stranded fi- 
sheet (P3 and fiS-p 12) spans both arms of the L. with a domain of a 
a - plus 3-typrstructure 02. P4-&7. al and o2 on ones£ 
(within arm 1) and a domain of mostly o-hehces (u3-«9) on the 




Yl« H19 




K9ure , Cysta! structure of HisP. a Via* of me dimer along an ax.a 
parpandlcu.ertoltstwC-fc.daxiS.Tne topand bottomotlhe dimer era suggested 
„ L towards the perlo.ssmlc and Cy^plasnHc side., respect, (see ,axtl 
Tha thickness of arm II is about ZSA. comparade to mat of membrane. <.-Ha.,cas 

HiaP dlmar showing *e retailve displawmont of the monomer not oppar<«,t ,n 

a Thep.suandeatthedimarlntarfaceara.aceltad "• v ^ 0,0 ^ m0 ~'^ 

the bottom o» arm .. aa shown in a. to^rda arm II. ^ 

oockat a-c The protein and me bound ATP ore n 'nbben and ball-and-stV-k 

re pre S eniadons. raspcctM^. <ey residuaa discussed .n tha text are indica-ad m 

7£L hflures were p^pared with K amino tarmlnu,: C. C 

lermlnua. 
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movpt APPROACH TO QUANTITATIVE POLYMERASE CHAIN REACTION USING 

^^J^nON TO THE DETECTION OF GENE 
AMPLIFICATION IN BREAST CANCER 

Iv8n BlicHE „ MsrtJneOuv... Mnrie-Hcleuc ChaMPEms'. Dominique Vtoi*. **« LidsrHau' and Michel Vfatf** 
^LaboratoiredVncogenctique. Centre Rene Huguemn. St-Cloud. France 



Gene amplification i s a common event in the pr ogres »lon of 
human cancers, and amplified oncogenes have ^"^^ 
have diagnostic, prognostic and UierapcUtic relevance. A 
Icmelic quantitative polymerase-chain-reacuon (PCR) method, 
oalod on fluorescent TaqMan methodology and a new .nstru- 
m^(AB Prism 7700 Sequence Detection System) capable 
3 mUtrhS Tn^or«cence q in real-time, wa, used 
oeno amplification in tumor DNA- Reactions are character- 
?«d VyThe potnt during cycling when PCR amplification ,s rt.H 
n the exponential ph«e. rather than the amount of PCR 
producracc^mulated after a fixed number ofcyc.es. None o 
&e reaction components is limitod during the «*P?"«™f 1 
X£ -oa^"9 that values are highly ™P^"^le ,n reac- 
tions starting with the same copy number. ™s g r ««y 
■ZJLS^t the orecision of ON A quantification. Moreover. 
££££ PCR does hot require pSst-PCR sample handling. 
ihe^bTpreventing potentUI PCR-product carry-over con- 
£m?n«lon: it pos£*ses a wide dynamic 
tion and results in much fa-.ter and higher sample t^u*P>£ 
The real-time PCR method, was used to develop and validate 
a sfmple and rapid assay for the detection and S"™*""^ 
of the 3 most frequently amplified genes ccndl and 

a D02) in breast tumors. Extra copies of "W*^™*^ 
were observed In 10. 23 and 1S%. respectively, of 108 breast- 
Tumor DNA; the largest observed numbers of gene cop.es 
weTe 4 M8 6 and IS 1. respectively. These results correlated 
, those of Southern blotting. The use^ of this new 

semi-automated technique will make molecular analysis ol 
human c°an^ s .mp.er^nd more -..able and shou.d find 
broad applications In clinical and research settings. Int. L 
Cancer 78:661-666. 1998. 
O I99S miey-Liss. fnc. 

Gene amplification plays an impoitant role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genoituc amplification 
was cytogenetic analysis. Amplification of several chromosome 
SLSTSLltaed either as Lrachromosonul double mmutcs 
(dmins) or as integrated homogeneously stsmmg regions (HSRs . 
arc among the main visible cyio S enet,e abnorrnaht.es »n breast 
rumors. Other techniques such as comparable genomic hybridiza- 
tion (CGH) (Kallioniemi et al. 1 994) have also been used ,n broad 
searches for regions of increased DNA copy numbers m tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Posiuoual cloning efforts are underway to idcnUry 
±e critical gcne<s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include tnyc W2A). 
cendi (llql3), and er6B2 (I7ql2-q21) (for review, sccBiechc and 
Lidcreau. 1995). 

Amplification of the myc. ccndl. and er&B2 Pn*~n~B*«* 
should have clinical relevance in breast cancer since u»deprad«K 
studies have shown that these alterations can be used to ^ .denary 
sub-populations with a worse prognosis^ fBerns « al wi. 
Schuuring er al. 1992; Slamon « al. 1987). Muss etal 1994) 
Rested* that these gene alterations may also b= useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in [«ms oi the fre- 
quency of these altcrattons and their clin.cal value ^""j 
over 500 studies in 10 years have failed to resolve ihe controversy 



surrounding the link suggested by Slamon etal 1987) between 
er6B2 amplification and disease progression. The* discrepancies 
are pardydue to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific ccnes (DNA) were initially quantified in tumor cells by 
means of bloning procedures such as Souths and ^bhxnng. 
These batch techniques require large amounts of DNA (5-10 
ue/rcaction) to yield reliable quantitanve results. Furthermore; 
Sous'carc is required at all sta ges ^ *e procc dures «> 
eencrate blots of sufficient quality for reliable dosage analysis. 
Sly PCR has proven to be a powerful u)ol for quanatanvc 
DNA analysis, especially with minimal slatting quashes of *mor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product cither after a given number of cycle, ; (end-point quanbta- 
dvc PCR) or after a varying number of cycles dunng the 
c^pon^ual phase (kinetic quantitative PCR). In the first case an 
tatcmal standard distinct from the target molecule ,s required to 
ascertain PCR efficiency. The method is rclat, vely easy but _impl.es 
generating, quantifying and storing an internal standard for each 
ge^e smdicd. Nevertheless, it is the most frequently applied 
method to date. , 

One of the major advantages of the *™? c ™*f"*^Z 
in quantifying a new gene, since no internal standard .s required (an 
extlwtl standard curve is sufficient). Moreover, the kmeuc ; mrfbod 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples diffenn S * their copy nuinber. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used: It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each jUiouot, the amo- 
tion product. Interest in the kinetic method has been scunulared by a 
aovel approach using fluorescent TaqMan methodology an U new 
instrument (AB1 Prism 7700 Sequence Detection System) capable 
of mcasurine fluorescence in real time (Gibson et al. 1996; He.d « 
al ml ySZ TaqMan reacrion is based on d,e 5' nuclease assay 
fet described by Holland « ol (1991). The latter uses the 5 
SScase activity of Taq oolyme^e to cleave a s^fic fl ^orogcuic 
oligonucleotide probe during the extension phase of FCB- The 
approach uses dull-labeled fluorogenic hybndizauoo 
er at 1993). One fluorescent dye, co-valcntly linked to the 5 end 
of the oligonucleotide, serves as a reporter [FAM (ue. 6<«boxy- 
fluoreScein)l and its emission spectrum is qucnclied by a second 
fluorescent dye. TAMRA (i.e.. 6 ^ hQ ^^t°^K 
attached to the 3' end. During Ac extension phase of the PCR 
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cycle the fluorescent hybridation probe is ^"J**" b **f 
¥r nuclcolytic activity of DNA polymerase gd^^ 

e^s^ 

emission, rc»uiu« e u» -Arw ,, l1 ;-.j u v dividing the emission 
Thr- fluorescence sienal *s normalizca oy aivwuig 

iSSTbSfa » oJST* ratio defined as the Rn (normal**! 

The real-time PCR method offers several ^f^^fSe 
current quantitative PCR methods (Cell « aL 1994) u 0 
oX-bascd homogeneous assay provides a rcal-urae method for 
de«crin« only specific amplification products, since specific hybri- 
dtrion of bo* the primers and me probe is necessary to .genera* a 
S li) the C, (Areshotd cycle) value used for qu»™fi c $i n " 
measured when PCR amplification is still in the log phase of PCR 
pro<ho« accumulation. This is the mam reason why C, *a more 
S measure of the starting copy number than are ™»-pom 
me^uremenis, in which a slight difference in a limiting .con ponent 
TrTh^a drastic effect on U,e amount of product- M use of C, 
rivefTwider dynamic range (at least i lorders of map*- 
mdS reducing the need for serial dilution; H The "^JgJ 

amiable just one minute after thermal cycling 16 complete. W ffie 
Sample dTu S hput of the method is high, since 96 react.ons can be 
analyzed in 2 hi\ 
Here, we applied this semi-automated procedure to deternune 

samples. 

MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 108 primary breast tumoc iremoved 
surgically from patients at the Centre Rene Hugucnuv none of the 
Srieno had undergone radiotherapy or 

U after surgery, the ^S&SjS&^S^ 
nitrogen until extraction of mgh-molecuUr-wctgnt ^ 
were included in this study if the tumor sample used, for DNA 
prep Jation contained more than 60% of tumor ee ls (hutolog^ 
Lalysis). A blood sample wits also taken from 18 of the same 

Pa DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 



of extensive degradation) are both difficult to assess. Wo ^crtfore 
also quantified a control gene (alb) mapping ur chro ™^^?™ 
4ol l-ql3. in which no genetic alterations have been found m 
breast-tumor DNAby means of CGH (Kalliomcm, « aL. 1994). 

Thus. die ratio of the copy number of the target gene to me copy 
number of the alb gene normalizes the amount 
Genomic DNA. The ratio defining the level of amplication is 
termed "N", and is determined as follows: 



N = 



copy nu mber of target gene (opp. eandl. erbBl) 
copy number of reference gene {alb) 



ReaMme PCR , . . . ■ . , 

Theoretical basis. Reactions are characterized by the pom 
durine cTcUnc when amplification of the PCR product is first 
S 7SL foTby the amount of PCR product accumutoed 
£ afixed number of cycles. The higher the Starting copy number 
of^e genomic DNA target, the earlier * significant * crease ,n 
fluorescence is observed The parameter Q (threshold cycle) ts 
SET* rraaiona. cycle number at which the ^sc^ce 
rr^^rAtcd bv clcavaee of the probe passes a fixed threshold. aoovc 

CSS 

gnomic DNA (based on optical density) and its quality («.*.. Uek 



Primers probes, reference human genomic DNA and PCR 
consul Pnmcrs and probes were chosen with the assatance 
7L computer programs Oligo 4.0 (^"oaa B^enees. Ply- 
mouth. MN). EuGene (Daniben Systems. Ctncinnau. OH) and Pnmer 
Express (PeWElmer Applied Biosystems. Foster Cry. CA) 

Primers were purchased from DNAgency (Malvern. PA) and 
probes from Perkin-Elmcr Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybndmuon 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, MicroAmp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Wa^-cwve construction. The kinetic memod requires a 
stanlrd curve. The lancr was construct^ with senal d, uuo„ s of 
soecific PCR products, according to Piattk f «L ( 993). In 
prLuce, each specific PCR product was obtained by amphfytng20 
standard human gnomic DNA (Boehrmgcr.Manr.heim. 
Germ^y) with the same primer pairs as those used later for 
S quantitative PCR The 5 PCR products were ponfied 
ustna MicroSpin S-400 HR columns (Pharmacia. Uppsala. Swe- 
S eSphorezed through an acrylamide gcUnd stamed wnh 
Sum bromide to check their quality. Tie PCR ****** 
then quantified spectrophotcimcmcal ly and pooled and senaUy 
diluted 10-fold in mouse genomic DNA (Uontech, ^Polo • AUo. CA) 
S n constant concentration of 2 ng/pl. Tho sumdard curve use fo 
real-time quantitative PCR was based on sens! ddul.ons of the pool 
ofpCR products rangins from 10"' (10» copies of each gene)to 
10-» (101 CO pi eS ). This scries of diluted PCF products was 
aliquoted and stored at -S0°C until use. 

The standard curve was validated by analyzing 2 known 
quanrities of calibrator human genomic DNA(20 ng and 50 ng). 

PCR omplijication. Amplification mixes (50 I containe^ I the 
sample DNA(around20 ng, around ™W?*«™ e *fi& 
10X TaqMan buffer (5 ul). 200 ^ff^Z^S^of 
uM dUTP 5 mM MgCl,, 1.25 units of AinphTaq Cold, 0.5 units ol 
A^pSTe uSil NSycosylase (UNG) 200 .uM each ^pnmerand 
Too nM probe. The thermal cycling conthttons f^P" 4 ^SSS 
50-C and 1 0 min at 95'C. Thermal cycling consisted of 40 cycles^ 
95'C for IS s and 65'C for 1 min. Each aisay included a standard 
curve (from I0» to 10» copies) in duplicate, a no-template comix 
20 ng S» ng of calibrator human genomic f^fSSLt 
dplicatc. and Ibout 20 ng of unknown t^™^™^ 
§6 samples can thus be analyzed, on a 96-weUj 
samples with a coefficient of variation (CV) h.gher than 10/. were 

AU reactions were performed in the ABI Prism 7700 Sequence 
DetecrioT System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogentc probe dunng PCR 

"Tlvzed automatieally. The software accompanymg d« 7700 
TytL calculates Cand determines .hcMarting copy number in the 
samples. 
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Determine ^ffiP jl 
calculated as described above. Only samples w»ti an n 
higher than 2 were considered to be amplified- 

RESULTS 

and ^Jinyloid precursor protein gene feppl. wruch maps to a 
ana mc p-w»juv» , which no ccnetic alterations have 

chromosome region (2 lq21.2) m wlucti nc ► gcnc» 
been found in breast tumors (Kalhomerru et fl/., r W lap 
atomic gene was the albumin gene (a/6, chromosome 
4qli-ql3). 



Validation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR P"^*^'* 
diluted in genomic mouse DNA at a constat concentrauon of 
2 SX Ss&be noted that the 5 primer pairs el^ o«^ 
le 5 tanrct genes do not amplify genomic mouse DMA (data not 
^UnTngT Uhows the PCR standard curve HJr the 

alb cent : The dynamic range was wide (at leas 4 orders of 
with samples containing a* few as 10* cop.es or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference ^nes top and albj 

The <w to a/6 copy-number ratio was determined in 18 normal 
teSj^WA sarrfples and all 108 primary breasNCumcr DNA 
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samples. We selected these 2 genes because Aey aw locawd in 2 
chromosome regions {app. 21q21.2; otf- 
obvious genetic changes (including gam.< >J 
observed in breast eancers (K*lHonicrni «c a*. 1994). The ratio for 
the 13 normal leukocyte DNA samples fell between 0.7 and 3 
(mean 1-02 ± 0.21), and was similar for the 108 primary breast- 
E DNA samples (0.6 to 1.6, mean 1.06 ± 0.25). "ataing 
that alb and app are appropriate reference disomw genes for 
breast-tumor DNA. The low range of the ratios also confirmed lhat 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl andabB2 gene dose in normal leukocyte DM 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N). calculated as described in Materia^ and 
Methods 0 . The N value of these samples ranged from 0.. to u 
(mean 0.84 r 0.22) for myc. 0.7 to 1.6 (mean 1.06 r 0 23) for 
ccndl and 0.6 to 1.3 (mean 0.91 4 0-19) for er6B2. Since N values 
for myc ccndl and er6B2 in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 
myc. cend 1 and exbBI gene dose in breast-tumor DM 

myc ccndl and eroB2 gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra cop.es of ccndl ^werc 
more frequent (23%, 25/108) than extra copies of **B2 (15 A 
16/108) and myc (10%. 11/108), and ranged from 2 to 18.6 for 
ccndl. 2 to 15.1 for cr6B2, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145). 6-fold (T133) and iwn-amplified (Tl 18). 
The 3 genes were never found to be co-amplified in the same^cumor 
erbm and ccndl were co-ernplified in only 3 cases, myc and ccndl 
in 2 cases and myc and er*B2 in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% w _ the 
er6B2 copy number (N < 0.5). suggesting that ihey bore deletions 
of the 17q21 region (the site of <roB2), No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc. ccndl and erbhl amplifications 
had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time PGR and 
Southern blot was obtained for tumors with high copy numbers 
(N 2: 5). However, there were cases (I myc 6 ccndl and 4 erbm) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2 .9). 

DISCUSSION 

The clinical applications of gene amplification assays arc 
currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but tins 
method is not sensitive enough to delect low-level gene amplica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TA6U 1 - DISTRIBUTION OF AMPLIFICATION L£VELfN) FOR myc; 
AN0 crbD2 GENES XU 108 HUMAN ORi^ST TUMORS 

Amplification kv€*(N) 



Gene 

myc 

ccndl 

crbBl 



0.3-1.9 



0 
0 

5 (*.6%) 



97 (89.8%) 
S3 (76.9%) 
87 (30.6%) 



11(10.2%) 
17(15.7%) 
8 pA%) 



(1.4%) 
(7.4%) 



reagents and requires relatively large amounts or high-quality 
genomic DNA, which means it cannot be used rounnely m many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DKA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paniffin-embcdded tissues). 

lo this study, wc validated a PCR mclhod developed for the 
quantification of gene over-reprcscnturion in rumors. The meihod\ 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-bascd quantitative assays such as 
compeudve quantitative PCR (CeJi er at. 199-4). First, the feal-time 
PCR method is performed in a closed- tube system, avoiding the 
risk of contamination by amplified products. Rc-amphficatton of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
flLongo ct aL 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable 
Wc found it possible to determine, in triplicate, the number of 
conies of a target gene in more than 100 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain <:qual starting amounts 
of DNA This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Vourth, and above al^ 
real-time PCR makes DNA quantification much more precise and 
reproducible, since . it is based on C ( values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when FCR amplification is still in the exponential phase 
and when none of the reaction components is raie-hmitmg. The 
within-run CV of the Q value for calibrator human DNA (5 
reolicalcs) was always below 5%, and the berween-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing ihe 
G ratio of the target gene with that of reference genes The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-time PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is labonous). 

The only potential disavantagc of real-timi PCR, like all other 
PCR-bascd methods and solid-matrix bloniog techniques (South- 
em blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from fc*nor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistochem- 
istry can measure alterations on a cell-by-ccll basis (Pauletn et aL, 
1996- Siamon et aL 19S9). However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene produci is over-expressed in 
the absence of gene amplification, which w,U be possible in the 
future by real-time quantitative RT-PCR. Immune histochemistry is 
subject to considerable variations in die hands of different teams, 
owing to alterations of target proteins during the procedure the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature, (i) Chromosome regions 4qll-q13 and 21q21.2 
(which bear alb and app. respectively) showed no gcuctjc alter- 
ations in the breast-cancer samples studied liere, in keeping with 
S^sSis of CGH (Kallioniemi et aL W 0<) We found that 
amplifications of these 3 oncogenes were independernt events as 
reported by other teams (Bents et aL 199?.; Berg et al 1992). M 
The frequency and degree of myc amplification in our breast rumor 
DNA scries were lower than those of ccndl and eroB2 amplifica- 
tion, confirming the findings of Borg et aL (1992) and CourjaU/ aL 
(1997) (iv) The maxima of ccndl and crb&2 over-representation 
were 18-fold and 1 5-fold, also in keeping with earlier results (about 
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Cycle 

CONDI 



ALB 



Tumor 


°x e 


opy number 


Ct 


Copy number 


■ T118 


27.3 


4605 


26.5 


4365 


M T133 


23.2 


61659 


25.2 


10092 


m T145 


22.1 


125892 


25.6 


7762 



FiGUbeZ -ecndl end bib gene dosage by PCR. k > -<*« ™ «| I'SS^ffiStfJKS 

, . -.i / v ./ . iqq^ rw results also correlate well «"ith those recently 

30-fold maximum) (Ben* a at. 1 992; Borg « of 1992; Cou^al ct « fS^ltStohSS (1997) who used the TaqMan system to 

at. 1997). (v) The «*B2 copy n-mb« ob*med ««« ^r^S in a small scries of breast tumors 

PCR were in good «~m«>.»>* T.r^bTfebW,^ ?^^fflL»i^(l5-S0ataDi«a«»»P«o^ 
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TAtttC II - EXAMPLES OF ccndl GENE OOSACe RESULTS 
FROM 3 BREAST TUMORS' 



Tumor 




csndJ 












Copy 
numbw ' 


Mcoa 


so 


Copy 
number 


Mean 


so 


TU8 


4525 






4223 




89 






4605 


*603 


77 


4365 


4325 


1,06 




4678 






4387 








T133 


59821 






9787 






6.03 


61659 


61100 


nil 


10092 


10137 


375 




61821 






10533 








T145 


128563 






7321 






16.34 


125892 


125392 


3448 


7762 


7672 


316 




121722 






7933 









'For each sample, 3 replicate experiments wen: performed arid (he mean 
and the standard deviation (SD) was dcterrruned. The level of ccndl gene 
amplification (Nccndl/alb) is decamined by dividing Ac average ccndl 
copy number value by the average alb copy number value 

point measurement of fluorescence intensity. Here we report myc. 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(2:5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and er6B2) observed by means of rcaJ-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tctrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et ai 4 1992; 
Slamonefa/.. 1987). 

Finally, this technique can be applied tn the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbBl (but not of the other 2 proto- 
oncogenes) in several tumors; cr6B2 is localcd in a chromosome 
region (I7q2l) reported to contain both deletions and amplifica- 
tions in breast cancer (Bicchc and Lidcrcau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-ncoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 
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<£irst sequence: pl.DJ : }4804 (length = 52£) 
<second sequence: pl.r itzman (length = 673) 



APPENDIX B 



<597 matches in an overlap of 598: 99.83 percent similarity 
<gaps in first sequence: 1 (75 residues), gaps in second sequence: 0 
<score: 2895 (Dayhoff PAM 250 matrix, gap penalty ='8+4 per residue) 
<endgaps not penalized 

10 20 30 40 50 60 

pi . DNA448 04 MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVF 
************************************************************ 

pi . hoi t zman MCSRVPLLLPLLLLLALGPGVQGCPSGCQ^ 

10 20 30 40 50 60 

70 80 90 
pi . DNA44804 ENGITMLDASS FAGLPGLQLLDLSQNQI AS 



********* ******************** 



pi . holtzman ENGITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPl^LSNLDLT^RLHEITNETF 

70 80 90 ~100" " " 110 " "120 



pl.DNA44804 



100 

LRLPRLLLLDLSHNS 

*************** 

pi • holtzman RGLRRLgRLJI^KNRIRHIQPGAFDTLDRLL^ 

130 " 140 " 150 ~160 170 " 180 



110 120 130 140 150 160 

pi . DNA44804 LI^EPGILDTAWEALRI^GLGI^LDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRG 
************************ * * ********************************** 

pi . holtzman LLALEPGILDTANVEALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRG 

190 200 210 220 * 230 240 

170 180 190 200 210 220 

pi . DNA44804 LTRLRIJVGOTRIAQLRPEDLAGLAALQ 

************************************************************ 

pi . holtzman LTRLRLAGNTRIAQLRPEDLAGLAA^ 

250 260 270 280 290 300 

230 240 250 260 270 280 

pi . DNA44804 NCVCPLSWFGPWWESHVTLAipPEETRCHF 

************************************************************ 

pi . holtzman NCVCPLSWFGPWVRESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPT 

310 320 330 340 350 360 

290 300 310 320 330 340 

pi . DNA44804 TRPVWEPTALSSSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTC 
************************************* * * ********************* 
pi - holtzman TRPVVREPTALSSSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTC 

370 380 390 400 410 420 

350 360 370 380 390 400 

pi . DNA44804 HLGTRHHI^CLCPEGFTGLYCESQ 

************************************************************ 

pi . holtzman HLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGL 

4 30 440 450. 460 470 480 

410 420 430 440 450 460 

pl.DNA44804 QRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRL^ 

************************************************************ 

pi . holtzman QRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPAS 

490 500 510 520 530 540 



470 



480 



490 



500 



510 



520 



pi. DNA4 4804 GRVPEGF' ~ CGEAHTPPAVHSNHAPVTQAREGNLP* ~ % APALAAVLLAALAAVGAAYCVR 
*******/***** I********************** 

pi . holtzman GRVPEGEEACGE^TPPAVHSNHAPVTQAREGNLPLLIAPALAAVLLAALAAVGAAYCVR 

550 560 570 580 590 600 

530 540 550 560 570 580 

pi . DNA44804 RGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPG 
************************************************************ 
pi . holtzman RGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPG 

610 620 630 640 650 660 

590 

pl.DNA44804 PGLQS PLHAKPYI 
************* 

pi. holtzman PGLQS PLHAKPYI 

670 



Sequence file: /home/i y/va/Molbio/carpenda/temp. btie/pl .holtzman 
motifs in /usr/local/seq/libdata/motif .pro 

Motif name: N-glycosylation site. 
Accession: PS00001; 
Motif: N[!P] [ST] [ ! P] 

10 1 NLSN 

117 NETF 

273 NLSL 

500 NLSG 

528 NATY 



Sequence file: /home/: y/va/Molbio/carpenda/temp . ;tie/pl . DNA44804 
motifs in /usr/local/seq/libdata/motif .pro 

Motif name: N-glycosylation site. 
Accession: PS00001; 
Motif: N[!P] [ST] [IP] 

198 NLSL 
425 NLSG 
453 NATY 



HMM file: 
Sequence file: 



/usr/seqdb/pf am/Pf am_ls 
pl.DNA44804 



Query: DNA44804 [598 aa] 



Scores for sequence 



family classification (score includes all domains) 



Model 



Description 



LRR 

LRRCT 

EGF 

LRRNT 

fn3 



Leucine Rich Repeat 

Leucine rich repeat C- terminal domain 
EGF- like domain 

Leucine rich repeat N- terminal domaxn 
Fibronectin type III domain 



Score 


E 


-value 


N 


59.2 


8 


.8e-14 


7 


47.1 




4e-10 


1 


30.0 


5 


.4e-05 


1 


29.8 


6 


.5e-05 


1 


13 .0 




0.15 


1 



Parsed for domains: 

Model Domain seq-f seq-t 



hmm-f hram-t 



score E -value 



LRRNT 


1/1 


23 


51 


LRR 


1/7 


53 


76 


LRR 


2/7 


77 


102 


LRR 


3/7 


118 


141 


LRR 


4/7 


142 


164 


LRR 


5/7 


165 


189 


LRR 


6/7 


190 


212 


LRRCT 


1/1 


223 


275 


EGF 


1/1 


334 


366 


LRR 


7/7 


415 


437 


fn3 


1/1 


383 


474 



1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 



31 [ 
25 [ 
25 [ 
25 [ 
25 [ 
25 [ 
25 [ 
54 [ 
45 [ 
25 [ 
84 [ 



29.8 
5.7 
9.4 
10.4 
19.1 
11.1 
12.3 
47.1 
30.0 
3 . 1 
13 .0 



6 .5e-05 
2.1e+02 
65 
44 
0.1 
26 
12 
4e-10 
5.4e-05 
4.8e+02 
0.15 



HMM file: 
Sequence file: 



) /usr/seqdb/pfara/Pf amis- 
pl.holtzman 



Query: holtzman [673 aa] 

Scores for sequence family classification (score 
Model 



LRR 

LRRCT 

EGF 

LRRNT 

fn3 



Description 
Leucine Rich Repeat 

Leucine rich repeat C- terminal domain 
EGF- like domain 

Leucine rich repeat N-terminal domain 
Fibronectin type III domain 



includes all domains) : 

Score E -value N 



108 
47 
30 
29 
13 



8 
1 
0 

.8 
.0 



le-28 
4e-10 
5.4e-05 
6.5e-05 
0.15 



11 
1 
1 
1 
1 



Parsed 
Model 


for domains : 
Domain seq-f 




LRRNT 


1/1 


23 


51 


LRR 


1/11 


53 


76 


LRR 


2/11 


77 


100 


LRR 


3/11 


101 


124 


LRR 


4/11 


125 


148 


LRR 


5/11 


149 


169 


LRR 


6/11 


170 


192 


LRR 


7/11 


193 


216 


LRR 


8/11 


217 


239 


LRR 


9/11 


240 


264 


LRR 


10/11 


265 


287 


LRRCT 


1/1 


298 


350 


EGF 


1/1 


409 


441 


LRR 


11/11 


490 


512 


fn3 


1/1 


458 


549 



-f 


hmm-t 




score 


E -value 


1 


31 


[] 


29.8 


6.5e-05 


1 


25 


[] 


6.1 


1.9e+02 


1 


25 


[] 


21.6 


0.019 


1 


25 


[] 


15.6 


1.2 


1 


25 


[] 


18.1 


0.21 


1 


25 


[] 


9.7 


58 


1 


25 


[] 


6.1 


1.8e+02 


1 


25 


[] 


10.4 


44 


1 


25 


[] 


19.1 


0.1 


1 


25 


[] 


11.1 


26 


1 


25 


[] 


12.3 


12 


1 


54 


[] 


47.1 


4e-10 


1 


45 


[] 


30 .0 


5.4e-05 


1 


25 


[] 


3.1 


4.8e+02 


1 


84 


[] 


13.0 


0.15 
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with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. • 

3. As part ofthe Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins?. When such a tumor antigen protein is 
identified, one can produce an antibody thatrecognizes and binds to that protein 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. Inthec 0uree . 0 f m ^ 

Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells 
at genomic DNA, mRNA and protein levels. An important example of one such ' 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than m corresponding normal human cells. To date we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor ' 
antigen proteins in both human cancer cells and corresponding normal cells We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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SECOND DECLARATION OF PAUL POLAKIS. Ph.D 



I, Paul Polalds, Ph.D., declare and say as follows: 

1. I am cuixently employed by Genentech, Inc. where my job title is Staff 
Scientist. 

2. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentecb's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers ihar find use 
as targets for both the diagnosis and treatment of cancer in humans. 

3. As I stated in ray previous Declaration dated May 7, 2004 (attached as 
Exhibit A), ray laboratory has been employing a variety of techniques, 
including rnicroarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s). 

4. In the course of our research using rnicroarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proieins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, h V means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

5. As shown in Exhibit B, of the 3 1 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (i.e., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 
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6. 



7. 



Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRN A m a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
the normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption, 

I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made with the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title IS of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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EXHIBIT A 



DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1 . I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both die diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins'*. When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely usefiil for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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HER-2/neu Breast Cancer Predictive Testing 

Julie Sanford Hanna t Ph.D. and Dan Mornin. MD. 



Each year, over 182,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors mosl predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease-free 
survival The gene has been shown to be amplified and/or 
overexpressed in )0%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, PathVysion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-ampiifica- 
tidh. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2Meu 
status may be particularly important to establish in women with 
small (< 1 cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fluorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates mat HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage-2 
breast cancer patients. 

Selection of patients for Herceptin c (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin 0 in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMCVPAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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Contributed by Patrick O. Brown, August 6, 2002 

Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
report a genome-wide microarray comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation in 
a series of primary human breast tumors. We have profiled DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene- by-gene) mapping of am pi icon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation in gene copy number contributes to 
variation in gene expression in tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors is directly attributable to underlying variation in gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
[e.g, FGFR1 (8pll), MYC (8q24), CCND1 (llql3), ERBB2 
(17ql2), and ZNF217 (20ql3)j and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)], the relevant gene(s) within 
other regions (e.g., gain of lq, 8q22, and 17q22-24, and loss of 
8p) remain to be identified. A high-resolution genome- wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrays, we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 
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this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell Lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. The 
fraction of tumor cells within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
arc summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnas.org. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

DNA Labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et ah (7), with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. "Test" DNA 
(from tumors and cell lines) was f luorescently labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene clusters). The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePtx scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles arc 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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Fig. 1- Genome-wide measurement of DN A copy number alteration by array CGH. (a) DNA copy number profiles are illustrated for cell Hnes containing different 
numbers of X chromosomes, for breast cancer cell lines, and for breast tumors. Each row represents a different ceil line or tumor, and each column represents 
one of 6,691 different mapped human genes present on the microarray. ordered by genome map position from Ipter through Xqter. Moving average (symmetric 
5-nearest neighbors) fluorescence ratios (test/reference) are depicted using a log^based pseudocolor scale (indicated), such that red luminescence reflects 
fold-amplification, green luminescence reflects fold-deletion, and black indicates no change (gray indicates poorly measured data), (o) Enlarged view of DNA 
copy number profiles across the X chromosome, shown for cell lines containing different numbers of X chromosomes. 



identifying the starting position of the best and longest match of 
any DNA sequence represented in the corresponding UniGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (i.e., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g., 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 



deletion. Parallel analysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. lb)> as we did before 
(7), demonstrated the sensitivity of our method to detect single- 
copy loss (45, XO), and 1.5- (47,XXX), 2- (48,XXXX), or 
2.5-fold (49,XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example, gains within lq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer cell lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
ly), as were losses within Ip, 3p, 8p, and 13q (80%/24%, 
80%/22%, 80%/22%, and 70%/18%, respectively), consistent 
with published cytogenetic studies (refs. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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Fig. 2. DNA copy number alteration across chromosome 8 by array CGH. (a) DN A copy number profiles are illustrated for cell lines containing different numbers 
of X chromosomes, for breast cancer cell lines, and for breast tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present on the microarrays and mapping to chromosome 8 are ordered by position along the 
chromosome Fluorescence ratios (test/reference) are depicted by a log 2 pseudocolor scale (indicated). Selected genes are indicated with color-coded text (red. 
increased- green, decreased; black, no change; gray, not welt measured) to reflect correspondingly altered mRNA levels (observed in the majority of the subset 
of samples displaying the ONA copy number change). The map positions for genes of interest that are not represented on the mtcroarray are indicated in the 
row above those genes represented on the array, (b) Graphical display of DNA copy number profile for breast cancer cell line SKBR3. Fluorescence ratws 
(tumor/normal) are plotted on a log 2 scale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors that were high grade (P = 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative (P = 0.04), and harboring TP53 mutations (P = 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20, can be r ; v: . ;l 
found in Figs. 6 and 7, which are published as supporting : M 
information on the PNAS web site). £ 

For a subset of breast cancer cell lines and tumors (4 and 37, ,^g#rV; 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA I 
microarrays (8). The parallel assessment of mRNA levels is ;rgg:> ; : ; V 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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Fig. 3. Concordance between DNA copy number and gene expression across chromosome 17. DNA copy number alteration {Upper) and mRNA levels {Lower) 
are illustrated for breast cancer cell lines and tumors. Breast cancer cell lines and tumors are separately ordered by hierarchical clustering {Upper), and the 
identical sample order is maintained {Lower). The 354 genes present on the microarrays and mapping to chromosome 1 7, and for which both DNA copy number 
and mRNA levels were determined, are ordered by position along the chromosome; selected genes are indicated in color-coded text (see Fig. 2 legend). 
Fluorescence ratios (test/reference) are depicted by separate iog2 pseudocolor scales (indicated).- 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; i.e., a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Fig. 4a). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
/ tests comparing adjacent classes: cell lines, 4 x 10 " 49 , 1 x 10" 49 , 
5 x 10" 5 , 1 x 10" 2 ; tumors, 1 X 10" 4 \ 1 x 10' 214 , 5 X 10" 41 , 
1 x 10~ 4 ). A linear regression of the average Iog(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively (Fig. 
4a y regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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Fig, 4. Genome-wide influence of DNA copy number alterations on mRNA levels, (a) For breast cancer cell lines (gray) and tumor samples (black), both 
mean-centered mRNA fluorescence ratio (log* scale) quartiles (box plots indicate 25th, 50th, and 75th percentile) and averages (diamonds; V-value error bars 
indicate standard errors of the mean) are plotted for each of five classes of genes, representing DNA deletion (tumor/normal ratio < 0.8), no change (0.8-1.2), 
low- (1.2-2), medium- (2-4), and high-level (>4) amplification, lvalues for pair-wise Student's t tests, comparing averages between adjacent classes (moving 
lefttoright),are4 x 10~ 4 * 1 x 10' 49 ,5 x 10" s . 1 x 10 " 2 (cell lines), and 1 x 10" 43 , 1 x 10' 21 * 5 x 10' 41 , 1 x 10" 4 (tumors). (6) Distribution of correlations between 
DNA copy number and mRNA levels, for 6,095 different human genes across 37 breast tumor samples, (c) Plot of observed versus expected correlation coefficients. 
The expected values were obtained by randomization of the sample labels in the DNA copy number data set. The line of unity is indicated, (d) Percent variance 
in gene expression (among tumors) directly explained by variation in gene copy number. Percent variance explained (black line) and fraction of data retained 
(gray line) are plotted for different fluorescence intensity /background (a rough surrogate for signal/noise) cutoff values. Fraction of data retained is relative 
to the 1.2 intensity/background cutoff. Details of the linear regression model used to estimate the fraction of variation in gene expression attributable to 
underlying DNA copy number alteration can be found in the supporting information (see Estimating the fraction of Variation in Gene Expression Attributable 
to Underlying DNA Copy Number Alteration). 



tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4d). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4d). This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor ceils themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by-gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generaJizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phillips et al. (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et al. (15) recently reported that in metastatic 
colon tumors only —4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et ai (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity amplicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within amplicons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochiometric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 
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Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis/the results showed^ 
that there is a gene dosage effect that in some cases 
superimposes on other regulatory mechanisms. This ef- 
fect depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in. relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1 t 
emsl, and N-myc (3-5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q- f 11p-, 1q+, 
11q13+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with 
g ene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 



1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional cell carcinoma; LOH, loss of heterozygosity; 
PA-FAB P, psoriasis-associated fatty acid-binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. 8, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
{left). The bold curve in the ratio profile represents a mean of four chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line {broken) indicates a ratio value of 1 (no change), and the vertical lines next to it (dotted) indicate a ratio of 
0.5 (left) and 2.0 {right). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
profile of that chromosome is shown to the right of the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDLDK/sdata.html)- The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; >2-fold 
increase {black), >2-fold decrease {blue), no significant change (orange). The bar to the far right, entitled Expression shows the resulting change 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (black), at least half of the genes 
down-reguJated (blue), or more than half of the genes are unchanged (orange). If a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2-fold change. A 2-fold level was chosen as this corresponded to one standard deviation in a double 
determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium-guanidinium thiocyanate 
solution and stored at -80 °C. Total RNA was isolated using the 
RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A) + RNA was isolated by an oligo(cTT) selection step (Oligotex 
mRNA kit; Qiagen). 

cRNA Preparation— 1 /xg of mRNA was used as starting material. 
The first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to the manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrtp® in vitro transcription kit (Ambion). Biotin-labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (Qiagen). 

Array Hybridization and Scanning— Array hybridization and scan- 
ning was modified from a previous method (13). 10 /xg of cRNA was 
fragmented at 94 3 C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 M NaCI, 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 °C, and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated for 16 h at 40 °C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 °C followed by 4 washes in 0.5x SSPE-T 
at 50 °C. The biotinylated cRNA was stained with a streptavidin- 
phycoerythrin conjugate, 1 0 /xg/ml (Molecular Probes) in 6x SSPE-T 
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for 30 min at 25 °C followed by 10 washes in 6x SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Affymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Microsatellite Analysis— Microsatellite Analysis was performed as 
described previously (14). Microsatellites were selected by use of 
www.ncbl.nlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 ^ for 35 
cycles. The amplicons were denatured and electrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Analysis— TCCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two-dimensional gel 
image of human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH— Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Ruorescein-labeled tumor DNA (200 ng) t Texas Red- 



labeled reference DNA (200 ng), and human Cot-1 DNA (20 jig) were 
denatured at 37 °C for 5 min and applied to denatured normal met- 
aphase slides. Hybridization was at 37 °C for 2 days. After washing, 
the slides were counterstatned with 0.15 jig/mi 4,6-diamidino-2-phe- 
nylindoie in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using f luorescein-labeled reference DNA 
and Texas Red-labeled tumor DNA {inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions of DNA gains and losses. The average 
green:red fluorescence intensity ratio profiles were calculated using 
four images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization —The CGH analysis 
identified a number of chromosomal .gains and losses in the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 



Top, CGH used as independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what CGH deviation was found). 



CGH alterations 


Tumor 733 vs. 335 


Concordance 


CGH alterations 


Tumor 827 vs. 532 


Concordance 


Expression change clusters 


Expression change clusters 


13 Gain 
10 Loss 


10 Up-regulation 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 


77% 
50% 


10 Gain 
12 Loss 


8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 


80% 
17% 




Tumor 733 vs. 335 


Concordance 




Tumor 827 vs. 532 


Concordance 


Expression change clusters , ijl . 
K CGH alterations 


K * CGH alterations 


1 6 Up-regulation 
21 Down-regulation 
15 No change 


1 1 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 


69% 
38% 
60% 


17 Up-regulation 
9 Down-regulation 
21 No change 


10 Gain 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 


59% 
33% 
81% 



two invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-, and X-, and 7+, 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter~, 3q12-q13.3-, 6q12-q22-, 
9q34+, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. 1/4) and 
20q12 in TCC 827 (Fig. 10). 

mRNA Expression in Relation to DNA Copy Number— The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1 ,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non-invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomaJ gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (Table I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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Fig. 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦) and their non-invasive 
counterparts 532 and 335. The expression change was taken from the Expression line to the right in Fig. 1 , which depicts the resulting 
expression change for a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)CEor both tumors TCC 733 (p < 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology (Fig. 2)\ Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatellite-based Detection of Minor Areas of Loss- 
es—In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25~32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1A). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fig. 3. Microsatettite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (b) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general ^-spectrin (gene number 1 1 on Fig. 1) and of (d) tumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1 118 
close to mitochondrial 3-oxoacyl-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the lower curves show the 
electropherogram from tumor DNA (7). In all cases one allele is 
partially lost in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH {Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels - 
2D- PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated {horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (0O) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
left), phosphoglucomutase 1 , glutathione transferase class jx number 
4, fatty acid-binding protein homologue, cytokeratin 15, and cyto- 
keratin 13; S (from left), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratin 13, and calcyclin; C (from left), a-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-e, and 
pre-mRNA splicing factor, D, mesothelial keratin K7 (type II); E (from 
top), glutathione S-transferase-ir and mesothelial keratin K7 (type II); 
F (from top and left), adenylyl cyclase-associated protein, E-cadherin, 
keratin 19, calgizzarin, phosphoglycerate mutase, annexin IV, cy- 
toskeletal y-actin, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na,K- 
ATPase 0-1 subunit; G, (from top and left), TCP20. calgizzarin, 70- 
kDa heat shock protein, calnexin, hnRNP H, cytokeratin 15, ATP 
synthase, keratin 19, triosephosphate isomerase, hnRNP F, liver glyc- 
eraldehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-ir, and keratin 8; H (from left), plasma gelsolin, autoantigen cal- 
reticulin, thioredoxin, and NAD+ -dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from top), prolyl 4-hydroxylase 0-subunit, cyto- 
keratin 20, cytokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; L (from fop and left), 90~kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunit, a-enolase, GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto-. 
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Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper part of the figure shows a 2D gel 
{(eft) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper get highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 {red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array {red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
specific binding). Absence of signal is depicted as black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure {left) show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). IEF, isoelectric focusing. 



keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 



Table II 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration 5 


Protein alteration 


Annexin II 


1q21 


733 


Gain 


Abs to Pres a 


Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3.8-Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 
Increase 


FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibitin 


17q21 


827/733 


Gain 


3.7-/2.5-Fold up b 


Increase 


ProIyl-4-hydroxyl 


17q25 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


hnRNPBI 


7p15 


827 


Loss 


2.5-Fold down 


Decrease 



* In cases where the corresponding alterations were found in both TCCs 827 and 733 these are shown as 827/733. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q2l) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may tie in the loss of controlled 
methylation in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case ail 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-, 9q~, 1q+, Y- 
(2, 6), and in pT1 tumors, 2q-,11p~, 11q-, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1 q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q-J- (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers. Analysis of these regions 
by positioning heterozygous microsatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal cells and Is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translational 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
transiationally inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
haJf-life (e.g. signaling proteins), A poor correlation between 
mRNA and protein levels was found in liver ceils as deter- 
mined by arrays and 2D-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef at. (26) in yeast. 
( Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript) One possible 
explanation could be that by losing one allele the change in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2-fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translationa! level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT 

Genetic changes underlie tumor progression and may !ead to cancer- 
specific expression of critical genes. Over 1100 publications have de- 
scribed the use of comparative genomic hybridization (CGH) to analyze 
the pattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrays in breast cancer and directly compared copy number 
and mRNA expression levels of 13,824 genes to quantitate the impact of 
genomic changes on gene expression. We identified and mapped the 
boundaries of 24 independent amplicons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
changes had a substantial impact on gene expression, with 44% of the 
highly amplified genes showing overexpression and 10.5% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression levels across 14 
samples were systematically attributable to gene amplification. These 
included most previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel amplicon at I7q21.3 was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Expression ratio 

Fig. 1. Impact of gene copy number on global gene expression levels. A. percentage of 
over- and undercxprcsscd genes (Y axis) according to copy number ratios (X axis). 
Threshold values used for over- and undcrexprcssion were >2.184 (global upper 7% of 
the cDNA ratios) and <0.4826 (global lower 7% of the expression ratios). B t percentage 
of amplified and deleted genes according to expression ratios. Threshold values for 
amplification and deletion were > 1 .5 and <0.7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these amplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome- wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (6) identify and characterize those genes whose mRNA expres- 



5 The abbreviations used are: CGH, comparative genomic hybridization; FISH, fluo- 
rescence in situ hybridization; RT-PCR. reverse transcription-PCR. 
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Fig. 2. Gcnomc-widc copy number and expression analysts in the MCF-7 breast cancer cell line. A, chromosomal CGH analysis of MCF-7. The copy number ratio profile (blue 
line) across the entire genome from lp telomere to Xq telomere is shown along with ± I SD {orange lines). The black horizontal line indicates a ratio of 1 .0: red line, a ratio of 0.8; 
and green line, a ratio of 1.2. B~C t genome-wide copy number analysis in MCF-7 by CGH on cDNA microarray. The copy number ratios were plotted as a function of the position 
of the cDNA clones along the human genome. In B % individual data points are connected wiih a line, and a moving median of 10 adjacent clones is shown. Red horizontal line, the 
copy number ratio of 1.0. In C, individual data points are labeled by color coding according to cDNA expression ratios. The bright red dots indicate the upper 2%, and dark red dots, 
the next 5% of the expression ratios in MCF-7 cells (overexpressed genes); bright green dots indicate the lowest 2%, and dark green dots, the next 5% of the expression ratios 
(underexprcssed genes); the rest of the observations are shown with black crosses. The chromosome numbers are shown at the bottom of the figure, and chromosome boundaries are 
indicated with a dashed line. 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Microarrays. The 
preparation and printing of the 13,824 cDNA clones on glass slides were 
performed as described (1 1-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 yug of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with AltA and RsaA (Life Technol- 
ogies, Inc., Rockville, MD) and purified by phenol/chloroform extraction. Six 
jug of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization ( 1 4, 15) and 
posthybridizarion washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty fig of reference RNA were 
labeled with Cy3-dUTP and 3.5 ^g of test mRNA with Cy5-dUTP ? and the 
labeled cDNAs were hybridized on microarrays as described (13, 15). For both 
microarray analyses, a laser con focal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DE ARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements (i.e., copy number data with mean reference 
intensity -C 100 fluorescent units, and expression data with both test and 
reference intensity <100 fluorescent units and'or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio > J .43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 
deleted. 

Statistical Analysis of CGH and cDNA Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled 1 for amplification (ratio, >K43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (i). We calculated a weight, w g% for each gene as follows: 

^ m gI - rrigo 

where m sU cr gl and m^, cr^ denote the means and SDs for the expression 
levels for amplified and nonamplified cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and AmpKcon Mapping. Each 
cDNA clone on the microaTray was assigned to a Unigcne cluster using the 
Unigene Build 1 41. 6 A database of genomic sequence alignment information 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



6 Internet address: http^/reseaich.rthgri.nih.gov/microamy/douTiload3bIe_cdra.htmI. 

7 Internet address: www.gcnomc.ucse.edu. 
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Table 1 Summary of independent amplicons in 14 breast cancer cell lines by 
CGH microarray 



Location 


Start (Mtii 


End (Mb) 


Size (Mb) 


lpI3 


132.79 


132.94 


0.2 


lq2! 


173.92 


177.25 


3.3 


Iq22 


179.28 


179.57 


0.3 


3pl4 


71.94 


74.66 


2.7 


7pl2.1-7plL2 


55.62 


60.95 


5.3 


7q3I 


125.73 


130.96 


5.2 


7q32 


140.01 


140.68 


0.7 


8q21.Ii-8q2l.I3 


86.45 


92.46 


6.0 


8q21.3 


98.45 


103.05 


4.6 


8q23.3~Sq24.l4 


129.88 


142.15 


123 


8q24.22 


151.21 


152J6 


1.0 


9p»3 


38.65 


39.25 


0.6 


13q22-<i31 


77.15 


81.38 


42 


16q22 


86.70 


87.62 


0.9 


17qll 


29.30 


30.85 


1.6 


I7ql2-q2l.2 


39.79 


42.80 


3.0 


17q2I32-q2L33 


52.47 


55.80 


3.3 


17q22-q23.3 


63.81 


69.70 


5.9 


|7q23.3-<}24.3 


69.93 


74.99 


5.1 


19ql3 


40.63 


41.40 


0.8 


20qll.22 


34.59 


35.85 


L3 


20ql3.I2 


44.00 


45.62 


1.6 


20ql3.l2-<ql3.13 


46.45 


49.43 


3.0 


20ql3.2-ql3.32 


51.32 


59.12 


7.8 



CGH were validated, with lq21, 17ql2-q2L2, I7q22-q23, 20ql3.1, 
and 20ql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q21.3 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overcx- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl J— pI2 (Fig. 3A). In BT-474, the two known amplicons 
at 17qi2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3B). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undescribed independent amplicon at 1 7q2 1.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3B> inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC812, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la- 
beled with SpcctrumOrange (Vysis, Downers Grove, IL), and Spectrum- 
Orange -labeled probe for EGFR was obtained from Vysis. SpecirumGreen- 
iabeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(18). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 r -GAGC AG AGGGACTCGGACTT-3 ' 
and 5'-GCGTCAGGTAGCGATTGTAG-3'. 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e., belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. J A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. IB). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resolution mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. .4, genes in the 
7pll-p12 amplicon in the MDA-468 cell line are highly expressed (red dots) and include 
the EGFR oncogene. B. several genes in the I7ql2, 17q2L3, and I7q23 amplicons in the 
BT-474 breast cancer ceil line arc highly overexpressed {red) and include the HOXB7 
gene. The data labels and color coding are as indicated for Fig. 2C. Insets show 
chromosomal CGH profiles for the corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR {red) and chromosome 7 
centromere probe (green) to MDA^68 (A) and //OAB7-spcciflc probe {red) and chro- 
mosome 17 centromere (green) to BT-474 cells (B). 
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Fig. 4. List of 50 genes with a statistically 
significant correlation (a value <0.05) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene are indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell lines. The 
key to the color code is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P - 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Ampi icons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in > 1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarray s to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



* Internet address: htrp;// www. geneontology.org/. 



0 Internet address: httpy/www.ncbi.nlm.nih.gov/entrcz. 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the dysregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 1 0, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the bomeobox gene region at I7q21.3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
turaorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC> 
EGFR, ribosomal protein s6 kinase, and AJB3, but also numerous 
novel genes such as NRAS-r elated gene (lpI3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q21.3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Summary 

The c-erbB-2 proto-oncogene encodes a transmembrane protein tyrosine kinase receptor of 185 kDa (pi 85) and 
has been associated with several types of human cancers. In human breast cancer, overexpression of pi 85 occurs in 
15-30% of cases, correlates with poor prognostic factors and characterizes breast cancers with a more aggressive 
behavior. Overexpression of pi 85 is usually associated with c-erbB-2 amplification, though it may occur inde- 
pendently and thus define subpopulations of breast cancers which might be of clinical interest, pi 85 expression 
is usually detected by immunohistochemistry (IHC) and few studies have been carried out to evaluate the pi 85 
content of breast cancers with an ELISA technique. In this context, we showed, in 106 breast cancer samples, that 
pl85 was expressed at high levels in 13.2%, intermediate levels in 55.7% and negative ones in 31.1% of cases. 
All pi 85 positive samples showed a c-erbB-2 oncogene amplification while none of the pi 85 negative samples 
and only 4% of pi 85 imtermediate samples had an amplification of c-erbB-2. pi 85 expression is significantly 
correlated with the negativity of estrogen and progestrone receptors, with high levels of cathepsin D and in some 
conditions with axillary nodal involvement. Thus, using the pi 85 ELISA assay, the c-erbB-2 status of breast 
cancers can be defined and moreover a subset can be discriminated which is characterized by intermediate levels 
of pi 85 and absence of c-erbB-2 amplification. The quantitative approach towards pi 85 in breast cancers affords 
the possibility of identifying more appropriately patients with high or low risk and thus permits adaptation of 
therapeutic regimens. 



Introduction 

The c-erbB-2 proto-oncogene encodes a transmem- 
brane protein tyrosine kinase receptor of 185 kDa 
(pi 85) having extensive homology with the epidermal 
growth factor (EGF) receptor, also known as c-erbB- 
1 [1]. Both these oncogenes belong to a family, also 
including c-erbB-3 and c-erbB-4 [2], whose altera- 
tions have been associated with several types of human 
cancers, particularly those involving the breast [3], 
The neu oncogene (which is the rat homologue of c- 
erbB-2) was initially identified in chemically induced 
rat neuroblastoma as the active transforming element 
[4, 5] and oncogenic activation of Neu occurs as the 
result of a single point mutation in the transmem- 



brane domain (Val 664 -> Glu) [6], Overexpression of 
pi 85, as well as specific point mutations in the trans- 
membrane domain, cause an increase of the tyrosine 
kinase activity of Neu and have been shown to have a 
transforming effect in cells [7, 8]. 

Although transmembrane domain point mutations 
are apparently rare in human tumor, overexpression of 
pi 85 has been identified in a wide range of human 
cancers and especially in breast and ovarian tumors 
[9]. In human breast cancer, overexpression of pl85 
occurs in 15-30% of cases and predicts a signifi- 
cantly lower survival rate and a shorter time before 
relapse in patients with lymph node-positive disease 
[9, 10]. Overexpression of pi 85 is usually associated 
with c-erbB-2 amplification, though it may occur in- 
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dependency and thus define subpopulations of breast 
cancers which might be of clinical interest. This fact 
shows that whatever the c-erbB-2 gene status, over- 
expression of pi 85 is the result of a transcriptional 
and/or translational dysregulation and evaluation of 
the pi 85 oncoprotein appears to be more appropri- 
ate for characterizing tumoral subpopulations than 
c-erbB-2 amplification. 

In previous studies, we reported an amplification 
of the c-erbB-2 oncogene in 1 5-20% of human breast 
cancer samples [II, 12] which is in agreement with 
the 10-34% of breast cancers having a c-erbB-2 amp- 
lification [13, 14]. However, the evaluation of pi 85 
expression might constitute a better approach, than 
c-erbB-2 gene amplification, regarding the predictive 
value of this parameter in breast cancer evolution. 
Thus, the aim of our study was to define subsets of 
breast cancers in terms of pi 85 expression and to 
determine the relationship of pi 85 expression with 
the accepted prognostic factors in a breast cancer 
population. 

Materials and methods 

Patients 

Primary breast carcinoma samples were obtained from 
106 women undergoing surgery, from July 1995 to 
December 1995, at the Medical School Hospital (Be- 
sangon, France). For each patient, several clinical and 
pathological parameters were collected afterwards: 
age, tumor size, the Scarf Bloom Richardson histo- 
logical grading (SBR grading), axillary nodal involve- 
ment, hormonal receptors and cathepsin D. 

DNA preparation 

The tumor samples were frozen and kept in liquid ni- 
trogen until processing. The tumor DNA was collected 
from the remaining cellular pellet after separation of 
the cytosol fraction used for the hormonal receptor 
assay. DNA obtained from the leukocytes of healthy 
donors was used as reference DNA. Total genomic 
DNA extraction was carried out using the Genomix 
method (Kontron Instrument, Milan, Italy) according 
to the manufacturer's instructions. 

Slot blot hybridization analysis 

Analysis of c-erbB-2 amplification was performed as 
previously described [11, 12]. For each sample, three 



DNA blots (10, 5 and 2.5 \xg) were performed and 
hybridized with a c-erbB-2 probe corresponding to 
a 1.03kb EcoRl fragment of pGneul (Dr O Brison, 
Institut Gustave Roussy, Villejuif, France). Filters 
were subsequently hybridized with a P-globin probe 
which was a 1.9kb BamHl fragment of pSP64 (Pr M 
Goosens, Inserm U91, Paris, France) for normaliza- 
tion. Both probes were labelled with digoxigenin-1 1- 
dUTP by Nick-Translation according to the recom- 
mendations of the manufacturer (Boehringer Man- 
nheim). After washing, labelled probes were revealed 
by a chemiluminescent reaction (DIG Luminescent 
Detection Kit, Boehringer Mannheim) as described by 
the manufacturer. Then filters were exposed for auto- 
radiography and signals were quantified using a laser 
densitometer Ultroscan XL™ (Pharmacia LKB). The 
degree of c-erbB-2 amplification was evaluated by the 
ratio of the corrected signal for the tumor DNA to the 
corrected signal for the leukocyte DNA. A tumor was 
considered to be amplified when the corrected ratio 
was at least twice that of the leukocyte DNA. 

Jmmunoanalysis 

Human pi 85 oncoprotein was measured by enzyme 
immunoassay using the Human Neu Quantitative 
ELISA Assay™ kit (Oncogene Science) on an aliquot 
of cell lysates before separation of the cytosol fraction 
used for the hormonal receptor assay. The pi 85 pro- 
tein ELISA assay was performed using two antibodies 
(NB3 and TA1) directed against the extracellular do- 
main of the protein. Cuny et al. (1994), using the same 
immunoassay, have shown that the maximum value of 
pi 85 in non-tumoral breast tissue samples was 3 U/u.g 
protein (1 U of pi 85 = 0.05 fmol) and this negative 
cutoff value was retained to define the pi 85 negative 
subgroup. A positive cutoff value does not seem to 
have been definitively established and differs accord- 
ing to the manufacturers. A positive cutoff value of 
20 U of pi 85 per microgram protein, which defines 
the pi 85 positive subgroup, was retained as the most 
informative positive cutoff value with respect to the 
correlation with the clinical and biological parameters. 
Both these cutoff values define a third subgroup of 
intermediate pi 85 (between 3 and 20U/u.g protein). 

Cytosol ic estradiol receptor (ER) and progester- 
one receptor (PgR) determinations were performed by 
enzyme immunoassay ER-EIA and PgR-EIA kits (Ab- 
bott, France). As established by the manufacturer, all 
values above 15fmol/mg protein were considered to 
be receptor positive. 



Cytosolic cathepsin D was measured by immun- 
oradiometric assay ELSA-Cath-D (Cis Biointerna- 
tional, France). A level of cathepsin D greater than 
60pmol/mg protein was considered to be pathologic, 
and a level below 30pmol/mg protein was considered 
to be normal. Both these cutoff values define a third 
subgroup between 30 and 60pmol/mg protein where 
the prognostic value of cathepsin D is more difficult to 
establish [15]. 

Statistical analysis 

A univariate analysis (x 2 -test) was used to compare 
the distribution of the main parameters describing the 
patients and the tumors. 

A Student's test was used to compare the mean 
concentrations. The difference was considered signi- 
ficant when the p value was less than 0.05. 



Results 

The lack of difference between the population studied 
and a population of 550 breast cancers, explored dur- 
ing the same period of time, demonstrates the absence 
of either an intentional or an unintentional selection 
bias and shows that this population can be considered 
as representative of the general population (Table 1). 

pl85 expression correlates with c-erbB-2 
amplification 

Immunoenzymatic assay of pi 85, performed in 106 
breast cancer samples, showed a pi 85 which was pos- 
itive in 14 cases (13.2%), intermediate in 59 cases 
(55.7%) and negative in 33 cases (31.1%). Slot blot 
analysis for c-erbB-2 amplification was performed in 
89 of these 106 samples and a c-erbB-2 amplification 
was detected in 1 4 samples ( 1 5.7%), ranging from two 
to 20 copies. The distribution of c-erbB-2 amplifica- 
tion in terms of pi 85 expression is shown in Table 2 
and there is a positive correlation between the de- 
gree of c-erbB-2 amplification and the level of pi 85 
expression (R 2 = 0.85). The pi 85 intermediate sub- 
group (52.8% of cases) is characterized by absence of 
gene amplification in 47/49 cases (—96%) and there 
is no amplification when pi 85 is negative (<3U/p,g 
protein). Thus, the cutoff values of pi 85 are suitable 
for discriminating breast cancers having a c-erbB-2 
amplification from those without amplification. 
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Table 1. Population comparison between a reference population 
and the population studied 



Parameters Reference pi 85 p 

population population 
(n = 550) (n = 106) 



Age at diagnosis 


<50 


135 


31 


0.31 


(years) 


>50 


415 


75 




Estrogen 


<15 


159 


27 


0.94 


receptors 


>15 


391 


79 




(fmol/mg prot.) 










Progesterone 


<15 


180 


32 


0.63 


receptors 


>I5 


370 


74 




(fmol/mg prot.) 










Cathepsin D 


<30 


125 


18 


0.26 


(pmol/mg prot.) 


30 to 60 


172 


40 






>60 


203 


45 




SBR histological 




73 


7 


0.16 


grade 2 


11 


301 


57 






III 


126 


28 




Axillary hodal 


N 0 


213 


34 


0.36 


involvement 3 


N+ 


157 


32 





I X 2 -test, the difference was considered significant when the p 
value was less than 0.05. 

2 SBR grading according to Scarff, Bloom and Richardson method. 
3 No sub-group encompass patients without axillary nodal involve- 
ment while N+ sub-group those with axillary nodal involvement. 



Table 2. Distribution of c-erbB-2 oncogene amplification and 
pi 85 oncoprotein expression in a population of breast carcino- 
mas 







pi 85 expression 






Positive 


Intermediate 


Negative 


c-erbB-2 


+ 12 


2 


0 


amplification 


0 


47 


28 



Relationship between p]85 expression and clinical 
and biological parameters 

Table 3 summarizes the correlation results and shows 
that pi 85 expression correlates with the estrogen 
receptors (p = 0.023), the progesterone receptors 
(p = 0.036) and the cathepsin D (p- 0.027). In or- 
der to better define the correlation of pi 85 expression 
with the steroid receptors, we determined the mean 
concentrations of both steroid receptors for each pi 85 
subgroup (Table 4). The results show that the pi 85 
intermediate subgroup expresses significantly higher 
levels of both steroid receptors as compared to the 
pi 85 positive subgroup. Surprisingly, the pi 85 nega- 
tive subgroup also expresses low levels of estrogen 
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Table 3. Relationship between pi 85 expression and clinical and biological parameters 
in a population of breast carcinomas 



Parameters pi 85 pi 85 pi 85 p 

positive intermediate negative 
(n = 14) (n = 59) (n = 33) 



Age at diagnosis 


<50 


7 


16 


8 


0.179 


(years) 


>50 


7 


43 


25 




Estrogen 


<15 


6 


9 


12 


0.023 


receptors 


>15 


8 


50 


21 




(fmol/mg prot.) 












Progesterone 


<15 


7 


12 


13 


0.036 


receptors 


>15 


7 


47 


20 




Cathepsin D 


<30 


0 


7 


11 


0.027 


(pmol/mg prot.) 


30 to 60 


4 


26 


10 






>60 


8 


25 


12 




Histological 


Ductal 


14 


56 


27 


0.152 


type 


lobular 


0 


3 


6 




SBR histological 


I 


2 


4 


1 


0.494 


grade 2 


11 


6 


38 


13 






III 


6 


12 


10 




Axillary nodal 


No 


3 


16 


15 


0.289 4 


involvement 3 


N+ 


5 


20 


7 




Size (cm) 


<1 


2 


6 


6 


0.620 




>1 


12 


52 


24 





1 X 2 -test, the difference was considered significant when the p value was less than 
0.05. 

2 SBR grading according to Scarff, Bloom and Richardson method. 
3 Nq sub-group encompass patients without axillary nodal involvement while N-f sub- 
group those with axillary nodal involvement. 

4 when pi 85 positive and intermediate are pooled and compared with pi 85 negative 
then p = 0.055. 



Table 4. Relationship between pi 85 expression and steroid receptors in a population of breast 
carcinomas 



Parameters 




n 


Estrogen 
receptors 
(fmol/mg prot.) 


P 1 


Progesterone 
receptors 
(fmol/mg prot.) 




pi 85 expression 


<3 


33 


72 ±27 


0.000007 2 


224 ±128 


0.073 2 


(UAig prot.) 


3-20 


59 


249 ±65 




403 ±141 






>20 


14 


61 ±68 


0.00047 3 


127 ±103 


0.0036 3 



1 Student's test, the difference was considered significant when the p value was less than 0.05. 
2 pl85 <3 vs. pi 85 3-20. 

3 pi 85 3-20 vs. pi 85 >20, (results are expressed as the mean ± sem). 



receptors which are not significantly different from 
those observed in the pi 85 positive subgroup. 

For the other parameters, there is no correlation 
with pi 85 expression when the two cutoff values sys- 
tem is applied. On the other hand, a positive cutoff 
value of 3 U of pi 85 per microgram protein improves 
the correlation with the axillary nodal involvement 



(p = 0.055). In this case statistical significance was 
reached when a positive cutoff value of 4 U of pi 85 
per microgram protein (p = 0.025) was applied and 
confirms the correlation of pi 85 overexpression with 
the axillary nodal involvement. However, there is a 
decrease in the statistical significance of correlations 
between pi 85 expression and the other clinical and 
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biological parameters when a positive cutoff value of 
4U of pi 85 per microgram protein is used. 

Thus, according to the positive cutoff value of 
pi 85, the correlation between the pi 85 expression 
and the clinical and biological parameters fluctuates. 
These fluctuations result from the ambiguous behavior 
of the pi 85 intermediate subgroup which is some- 
times associated with good prognostic factors (estro- 
gen and progesterone receptors) like the pi 85 negat- 
ive subgroup, and is sometimes associated with poor 
prognostic factors (cathepsin D and axillary nodal 
involvement) like the pi 85 positive subgroup. 

In our breast cancer population, no correlation 
of pi 85 expression with the histological type or 
grade was observed. In addition, breast carcinomas 
of lobular type never overexpressed pi 85, and only 
3/9 (33.3%) expressed pi 85 at intermediate levels 
(Table 3). 



Discussion 

c-erbB-2 proto-oncogene amplification occurs in 10- 
35% of breast cancers and constitutes a poor pro- 
gnostic factor [9, 10]. The main expression product of 
this gene is a 185kDa transmembrane protein whose 
level of expression depends on (i) the degree of amp- 
lification of the gene and/or, (ii) an increase of the 
transcription rate of the gene. Therefore, a substantial 
proportion of breast cancers express high or moderate 
levels of pi 85 without c-erbB-2 oncogene amplifica- 
tion. Characterization of this subset of breast cancers 
is of clinical interest and might thus be helpful for 
clinicians in the treatment decision process. 

In breast cancers, numerous studies have been per- 
formed to investigate the c-erbB-2 amplification by 
Southern blot, slot blot or FISH (Fluorescence in situ 
hybridization) as well as pi 85 expression by immun- 
ohistochemistry (IHC) [13, 14] in order to evaluate 
the prognostic value as well as the potential applic- 
ations of c-erbB-2 testing for predicting response of 
breast cancer to therapy. Despite the disparity of 
the results, it appeared that (i) the c-erbB-2 altera- 
tions (mostly amplification) were correlated with a 
shortened time before disease relapse and a dimin- 
ished overall survival and, (ii) that the most common 
association of pi 85 overexpression with therapeutic 
response was the apparent resistance of these tumors 
to hormone therapy alone. In summary, breast cancers 
overexpressing pi 85 correlate with poor prognostic 



factors and have a more aggressive behavior than those 
without overexpression. 

Immunoenzymatic assay of pi 85 and evaluation of 
the degree of c-erbB-2 amplification in breast cancers 
have rarely been performed simultaneously. When 
the quantitative method evaluating pi 85 was used to 
characterize c-erbB-2 status of breast cancer samples, 
authors have realized a dichotomous distribution ac- 
cording to various cutoff values [16-19]. However, 
a dichotomous distribution leads to a great loss of 
information [20]. In this context, like Cuny et al. 
[21], we realized a subdivision into three subgroups 
(positive, intermediate and negative) according to two 
cutoff values (20 and 3 U/jig prot.). Such a categoriz- 
ation cannot be obtained when c-erbB-2 amplification 
is used and therefore with pi 85 quantification it was 
possible to define a subset of breast cancers with mod- 
erate levels of pi 85. In our study, this subgroup of 
breast cancers (52.8% of cases) showed an ambigu- 
ous behavior as it was simultaneously associated with 
favourable prognostic factors (estrogen and proges- 
terone receptors) and unfavourable ones (cathepsin D 
and axillary nodal involvement). Dittadi et al. [22] 
showed in 115 breast cancer specimens, that pi 85 
evaluation by ELISA allowed definition of an in- 
termediate concentration group with a significantly 
better disease-free survival than the low and high con- 
centration groups. Therefore, it appears that a pi 85 
intermediate expression in breast cancers is preferably 
associated with favourable outcome. 

In the breast cancer population studied, we showed 
that the mean ER concentrations were different ac- 
cording to the pi 85 subgroup concerned. It has been 
observed that 17-0 estradiol downregulates c-erbB- 
2 oncogene expression [23, 24]. Thus, the high ER 
levels (249 ± 65 fmol/mg prot.) observed in the pi 85 
intermediate subgroup might explain the moderate ex- 
pression of pi 85. Conversely, the relatively low ER 
levels of pi 85 positive subgroup could be insufficient 
to repress c-erbB-2 expression. On the other hand, 
the pi 85 negative subgroup presents relatively low ER 
levels and this fact is not in agreement with the former 
hypothesis. This observation suggests the intervention 
of other factors in the regulation of c-erbB-2 expres- 
sion. In addition, this subset of breast cancers might 
have negative prognostic significance as suggested by 
Koscielny et al. [25]. The antiestrogen tamoxifen up- 
regulates c-erbB-2 oncogene in estrogen-responsive 
breast cancers [26]. Thus, the absence of clinical re- 
sponse to antiestrogen therapy in 40% of patients with 
ER-positive disease might be due to the upregulation 



168 LBermontetaL 



c-erbB-2 oncogene in the primary tumor. Conversely, 
some ER-positive tumors are characterized by the ab- 
sence of pl85 expression and this suggests that tamox- 
ifen and estradiol could control c-erbB-2 expression 
only when this gene is switched on. The mechanism 
responsible for this switching on is not well known 
but it might be independent of the presence of estro- 
gen receptors or dependent on the presence of other 
activated factors. 

As it was observed by Cuny et al. [21], breast car- 
cinomas of the lobular type rarely overexpress pi 85 
and if so, only at intermediate levels, and this confirms 
the specific expression of pi 85 by the ductal type. 
Moreover, the absence of correlation of p 1 85 expres- 
sion with the histologic grade suggests an early im- 
plication of c-erbB-2 alterations in the tumoral growth 
although some studies reported a correlation of p 1 85 
overexpression with high grade breast cancers [16, 
27, 28]. The reasons for these discrepancies may be 
related to the mode of selection of the sample popula- 
tion, technical problems or the criteria used to define 
overexpression. 

The paradoxical correlations of the pi 85 interme- 
diate subgroup with the clinical and biological para- 
meters suggests the existence of underlying factors 
which might discriminate the breast cancers hav- 
ing a truly pejorative prognosis from those hav- 
ing a more favourable issue. Christianson et al. 
[29] have identified, in human breast carcinomas, a 
NH2-terminal-truncated HER-2/neu protein of 95 kDa 
(namely p95HER-2/neu) which was expressed in 
22.4% of breast cancers and was preferentially found 
in pl85HER-2/neu-positive patients with lymph node 
involvement. A biological explanation of their findings 
was that loss of the extracellular domain of pi 85 from 
the p95 kinase combined with an amplified pi 85 sig- 
nal in primary breast tumor cells could promote their 
metastasis to the lymph nodes. 

There is at present no specific ligand known to 
bind pi 85 and pi 85 homodimerizes or heterodimer- 
izes with the other members of the c-erbB family [30]. 
Siegel et al. [31] have shown, in transgenic mice, that 
coexpression of c-erbB-2 and c-erbB-3 induced mul- 
tiple mammary tumors that frequently metastasized to 
the lung. Moreover, they detected in human breast can- 
cers an alternatively spliced form of c-erbB-2 which 
encodes an ErbB-2 receptor harboring an in-frame de- 
letion of 16 amino acids in the extracellular domain 
and can transform Rat-1 fibroblasts. Their observa- 
tions infer that coexpression of ErbB-2 and ErbB-3 
may play a critical role in the induction of human 



breast cancers, and evoke the possibility that activating 
mutations in the ErbB-2 receptor may also contribute 
to this process. 

Here, we showed that quantification of pl85HER- 
2/neu allows characterization of subsets of breast can- 
cers which are, or are not, correlated with c-erbB-2 
amplification but which support divergent correlations 
with the classical prognostic factors. It appears that, 
considering the multiple alterations affecting the c- 
erbB-2 oncogene (amplification, mutation, alternative 
splicing, overexpression) single determination of pi 85 
expression by an ELISA method is well adapted to 
characterization of the c-erbB-2 status of breast can- 
cers and could improve clinical information. However, 
further investigations of the c-erbB family members 
could be useful for better characterization of breast 
cancers and particularly those harboring an overex- 
pression of pi 85 without c-erbB-2 gene amplification. 
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ABSTRACT 

DNA copy number gains and amplifications at 17q are frequent in 
gastric cancer, yet systematic analyses of the 17q amplicon have not been 
performed. In this study, we carried out a comprehensive analysis of copy 
number and expression levels of 636 chromosome 17-specific genes in 
gastric cancer by using a custom-made chromosome 17-specific cDNA 
microarray. Analysis of DNA copy number changes by comparative 
genomic hybridization on cDNA microarray revealed increased copy 
numbers of 11 known genes (ERBB2, TOP2A, GRB7, ACLY, PIPSK2B, 
MPRIA5, MKP-U LHX1, MLN51, MLN64, and RPL27) and seven ex- 
pressed sequence tags (ESTs) that mapped to 17ql2-q21 region. To inves- 
tigate the genes transcribed at the 17q, we performed gene expression 
analyses on an identical cDNA microarray. Our expression analysis 
showed overexpression of 8 genes {ERBB2, TOP2A, GRB2, AOC3, AP2B1, 
KRT14, JUP, and ITGA3) and two ESTs. Of the commonly amplified 
transcripts, an uncharacterized EST AA552509 and the TOP2A gene were 
most frequently overexpressed in 82% of the samples. Additional studies 
will be initiated to understand the possible biological and clinical signifi- 
cance of these genes in gastric cancer development and progression. 



INTRODUCTION 

Gastric carcinoma is one of the most common malignancies world- 
wide and is the second most frequent cause of cancer- related death 
(1). Moreover, cardia, gastroesophageal junction, and esophageal ad- 
enocarcinomas have the most rapidly rising incidence of all visceral 
malignancies in the United States and Western world for reasons that 
are unclear (2). Previous studies have documented the importance of 
genetic alterations affecting known oncogenes, tumor suppressor 
genes, and mismatch repair genes in the development of gastric cancer 
(3, 4). Several genes, such as cMET, ERBB2, MYC, and MDM2, are 
amplified in 10-25% of tumors, and their amplification is associated 
with advanced disease (3, 5). Comprehensive DNA copy number 
analyses of gastric cancers using CGH 4 have demonstrated recurrent 
DNA copy number changes on several chromosomal regions. Gains at 
1 7q have been shown to be frequent in gastric cancers (6). However, 
the critical regions of genetic alterations are large, and the target genes 
for amplification at 17q remain unknown. 

Characterization of the chromosomal regions involved in DNA 
copy number changes is likely to reveal genes important for the 
development of gastric cancer. In the present study, we used a custom- 
made chromosome 17-specific cDNA microarray to systematically 
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evaluate copy numbers and expression levels of genes at 1 7q in gastric 
carcinomas. 

MATERIALS AND METHODS 

Samples. Sixteen gastric cancer xenografts, four gastric cancer cell lines 
(CRL-5822, CRL-5974, CRL-5973, and CRL-1739) from the American Type 
Culture Collection (Manassas, VA), and five primary gastric cancers were used 
in this study. The cell line (CRL-1739) with normal DNA copy number of 
chromosome 17 was included as a control in Northern blot hybridizations. The 
cell lines were cultured under recommended conditions. Xenografting of 
gastric cancers was performed as described earlier (7). All tumors included in 
this study were dissected and verified histologically to be composed predom- 
inantly of neoplastic tissues. We have earlier characterized the DNA copy 
numbers of the cell lines and xenografts using "chromosomal" CGH. The 
details of the DNA copy numbers of the xenografts have been reported 
elsewhere (7). Fig. \A summarizes the chromosomal CGH results for chro- 
mosome 17. 

Chromosome 17-specific cDNA Microarray. The construction of the 
chromosome 17-specific cDNA microarray has been described previously (8). 
Briefly, the cDNA microarray contained a total of 636 clones, including 88 
house keeping genes, 201 known genes from chromosome 17, and 435 EST 
clones from radiation hybrid map intervals D17S933-DI7S930 (293-325 cR, 
the 17ql2-q2l region) and D17S791-D17S795 (333-435 cR, the 17q23-q24 
region). The preparation and printing of the cDNA clones on glass slides were 
performed as described elsewhere (9). 

Copy Number and Expression Analyses by cDNA Microarrays. 
Genomic DNA was extracted from eight xenografts (XI 1, X27, X57, X71, 
X75, X79, X83, and X95) and three cell lines (CRL-5822, CRL-5973, and 
CRL-5974). All cases had gains or high-level amplification at 17q by chro- 
mosomal CGH (Fig. 1). Normal genomic DNA was used as a reference in all 
experiments. Copy number analysis using CGH microarray was performed as 
described previously (8, 10). Briefly, 20 /ig of genomic DNA were digested for 
14-18 h with Alul and Rsal restriction enzymes (Life Technologies, Inc., 
Rockville, MD) and purified by phenol/chloroform extraction. Digested gastric 
cancer test DNA (6 ^tg) was labeled with Cy3-dUTP (Amersham Pharmacia 
Biotech, Piscataway, NJ) and 6 /ig of reference DNA with Cy5-dUTP using 
Bioprime Labeling kit (Life Technologies, Inc.). Hybridization was done 
according to the protocol by Pollack et al. (10) and posthybridization washes 
as described previously (11). 

Total RNA was extracted from eight xenografts (X43, X49, X57, X68, X75, 
X76, X80, and X95) and three gastric cancer cell lines (CRL-5822, CRL-5973, 
and CRL-5974) by using RNeasy kit (Qiagen, GmbH, Hilden, Germany). A 
pool of four normal gastric epithelial tissue samples, enriched for the epithelial 
layer of the stomach through dissection and mucosal scrapping, was used as a 
standard reference in all experiments. Reference RNA (100 jig) was labeled 
with Cy5-dUTP and 80 jig of test RNA with Cy3-dUTP by use of oligode- 
oxythymidy late-primed polymerization by Superscript II reverse transcriptase 
(Life Technologies, Inc.). The labeled cDNAs were hybridized on microarrays 
as described previously (11, 12). 

For both the copy number and expression analyses, the fluorescence inten- 
sities at the cDNA targets were measured by using a laser confocal scanner 
(Agilent Technologies, Palo Alto, CA). The fluorescent images from the test 
and control hybridizations were scanned separately, and the data were analyzed 
using the DEARRAY software (13). After the subtraction of background 
intensities, the average intensities of each spot in the test hybridization were 
divided by the average intensity of the corresponding spot in the control 
hybridization. On the basis of our earlier reports (8, 14), clones that showed a 
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Fig. I. DNA copy numbers in gastric cancer. A, summary of gains and high-level 
amplifications affecting chromosome 17 in gastric cancer xenografts and cell lines by 
chromosomal CGH. Horizontal bars, the extent of the copy number aberration in each 
sample. High-level amplifications are presented as wide bars, xenograft samples X43 T 
X80, and cell line CRL-1739 had no detectable gains on chromosome 17. B, copy numbers 
survey of chromosome 17-specific genes in X83 gastric cancer xenograft by CGH 
microarray. The copy number ratios were plotted as a function of the position of the clones 
in the radiation hybrid map in cR scale. Individual data points were connected with a line. 
The chromosome 17 ideogram is shown below for visual comparison only. 



copy number ratio s 1 .5 were considered as amplified, and clones that showed 
an expression ratio ^ 3 were considered as overexpressed. Clones that showed 
such increased ratios in the self versus self control experiment were excluded 
from the analysis. 

Northern Hybridization. Total RNA was extracted from four gastric 
cancer cell lines and two normal stomach specimens using the RNeasy kit 
(Qiagen, GmbH). The Northern hybridization was performed using standard 
methods. Briefly, 10 /utg of total RNA were size-fractionated on a 1% agarose 
gel containing formaldehyde and transferred on a Nytran membrane 
(Schleicher & Schuel, Keene, NH). The membrane was prehybridized for 1 h 
at 65°C in Express hybridization solution (Clontech, Palo Alto, CA) together 
with sheared Herring sperm DNA (10 /xg/ml; Research Genetics, Huntsville, 
AL). Sequence-verified cDNA inserts were labeled with P 32 by random prim- 
ing (Prime-It; Stratagene, La Jolla, Ca). Hybridization was performed in the 
Express hybridization solution (Clontech) at 65 °C overnight followed by 
washes in 2 X SCC/SDS solutions. Signals were detected by autoradiography. 
The normal gastric tissues and CRL-1739 cell line (normal chromosome 17 on 
CGH) were used as control samples. A GAPDH cDNA was used as a control 
probe. 

Multiplex RT-PCR. Multiplex RT-PCR was used to validate the cDNA 
array results for the two most overex pressed genes (ESTAA552509 and 
TOP2A) using seven xenografted and six primary gastric cancer samples. For 
reference expression, a pool of normal gastric epithelial tissues obtained from 
different individuals was used. Primary tumors of four xenografts were in- 
cluded in the analyses. mRNA was purified from the tissues using mRNeasy 
(Qiagen), and cDNA synthesis was performed using Advantage RT-for-PCR 
Kit (Clontech). In each PCR reaction, primers for the human GAPDH gene 
were used as an internal reference. The PCR reactions were done using 
standard protocol for 28 cycles. We confirmed the reproducibility of the 
method by repeating the RT-PCR twice, and the results were consistent. The 
primers used for the RT-PCR were obtained from GeneLink (Hawthorne, NY), 
and their sequences are available on request. 

RESULTS 

Detailed Characterization of the 17q Amplification Using Chro- 
mosome-specific Microarray. Copy number levels of 636 chromo- 
some 17-specific genes were evaluated by CGH microarray in eight 
xenografts (XI 1, X27, X57, X71, X75, X79, X83, and X95) and three 
gastric cancer cell lines (CRL-5822, CRL-5973, and CRL-5974) that 



Table 1 Summary of copy number ratios of 18 chromosome 17qJ2-q2J transcripts in eight xenografts and three cell lines of gastric cancer by CGH microarray" 
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2.5 


3 


4.3 


2.7 


Hs. 


1 1 161 I 


AA 190881 


45301136 


17q21.2 


4.6 


2.5 


2.2 


6.1 


1.7 


0.9 


2 


2 


2.1 


1.4 


2.9 


Hs. 


171872 


A1540663 


46054957 


17q21.3 


1.4 


1.7 


0.5 


1.4 


0.9 


2.9 


1.6 


1.2 


1.9 


3.7 


1.1 



MRPL45 Mitochondrial ribosomal 

protein L45 
MKP-1 like protein tyrosine 

phosphatase (MKP-L) 
LIM homeobox protein 1 (LHX1) 
Phosphatidylinositol-4-phosphate 5- 

kinase, type II , 0 (PIP5K2B) 
EST 

EST (FU20940 hypothetical protein) 
H. sapiens MLN64 mRNA 
V-erb-b2 avian erythroblastic leukemia 
viral oncogene homolog 2 (ERBB2) 
EST 
EST 

Growth factor receptor-bound protein 

7 (GRB7) 
H. sapiens MLN51 mRNA 
Topoisomerase (DNA) II a (170kD) 

(TOP2A) 
EST 

ATP citrate lyase (ACLY) 
EST 

Ribosomal protein L27 (RPL27) 
EST (DEAD/H (Asp-Glu-Ala-Asp/His) 
box polypeptide 8 ? DDX8) 



" Copy number ratios above the 1 .5 threshold are shown in bold. 

h Alignment (bp position) and locus are shown according to Santa Cruz August freeze 2001 assembly. 
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Table 2 Summary of expression levels of JO chromosome I7ql2-q21 transcripts in eight xenografts and three cell lines of gastric cancer by cDNA microarray" 

Samples 



Gene 



Unigene Id 


Accession 


Alignment* 


Locus* 


X43 


X49 


X57 


X68 


X75 


X76 


X80 


X95 


CRL- 

5822 


CRL- 
5973 


PR! 

5974 


MS. /W/O 




j 1 jL / tw 


17nl? 


j ] 


i 
i 




■j i 
j.i 


0.9 


II 


0.9 


0.8 


4.8 


7.4 


4.6 


Hs. 286192 


AA552509 


41868584 


17q21.1 


21.9 


4.5 


6.4 


7.6 


10.2 


10 


17.3 


0.6 


12.1 


11.2 


0.6 


Hs. 323910 


AA446928 


41940229 


17q21.1 


1 


1 


3 


3.7 


1.4 


0.9 


1.3 


0.7 


24.6 


0.7 


1 


Hs. 156346 


AA026682 


42521254 


17q21.2 


4.1 


6.1 


16 


4.5 


2.8 


6.6 


3 


1.4 


5.6 


7.6 


6.8 


Hs. 117729 


H44127 


43757143 


17q21.2 


3.9 


1.4 


1.1 


1.6 


3.5 


1 


1.2 


0.6 


3.8 


1.3 


0.8 


Hs. 2340 


R06417 


43994962 


17q21.2 


3.1 


2.8 


0.9 


1.2 


43 


0.9 


2.6 


3.4 


5 


1.9 


2.5 


Hs. 198241 


T77398 


45078066 


17q21.2 


4.6 


1.9 


4.5 


2.6 


2.1 


2.6 


4.2 


1.6 


3.1 


1.8 


5 


Hs. 265829 


AA424695 


54688140 


I7q21.3 


4.8 


1.5 


0.9 


4.5 


3.4 


1.3 


1.1 


1.2 


2.7 


2.2 


0.5 


Hs. 56105 


AA284262 


65817334 


I7q23.2 


1 


1.7 


S3. 


3 


15.6 


2.8 


0.6 


1.9 


1.3 


0.6 


0.4 


Hs. 296381 


AA449831 


81840742 


17q25.I 


0.8 


0.8 


2.2 


1.4 


1.3 


1.8 


I.I 


1.3 


3.1 


5.6 


3.5 



Adaptor-related protein 

complex 2, /31 subunit 

(AP2B1) 
EST (Hypothetical protein 

FU20940) 
V-erb-b2 avian erythroblastic 

leukemia viral oncogene 

homolog 2 (ERBB2) 
Topoisomerase (DNA) II a 

(170kD) (TOP2A) 
Keratin 14 (KRT14) 
Junction plakoglobin (JUP) 
Amine oxidase, copper 

containing 3 (AOC3) 
Integrin, a-3 (ITGA3) 
EST 

Growth factor receptor- 
bound protein 2 (GRB2) 



" Expression ratios above the 3 threshold are shown in bold. 

* Alignment (bp position) and locus are shown according to Santa Cruz August freeze 2001 assembly. 



showed gain or high-level amplification affecting chromosome 17 by 
chromosomal CGH (Fig. 1). CGH microarray analysis revealed in- 
creased DNA copy numbers (ratio ^ 1 .5) in three or more cases for 
1 1 genes and seven ESTs that map to 17ql2 (4 clones) and 17q21 (14 
clones; Table 1). The amplified genes/ESTs were localized at 302- 
321 cR in the radiation hybrid map 5 (Fig. \B) and between 
38274220-46054957 bp at 17q, according to the University of Cal- 
ifornia Santa Cruz's August freeze 2001 assembly of the human 
genome sequence. 6 The two most consistently amplified clones were 
EST (H62271) and ribosomal protein L27 (82%). Other frequently 
amplified genes included TOP2A, EST AA552509, and ERBB2. The 
details of the copy numbers and location of these genes/ESTs are 
listed in Table 1. 

Gene Expression Profiling of 17q Using cDNA Microarrays. 
Parallel expression survey in eight xenografts (X43, X49, X57, X68, 
X75, X76, X80, and X95) and the three cell lines identified 10 
transcripts at 17q whose expression was elevated (ratio > 3) in at least 
three specimens, as compared with the normal gastric epithelial cells 
(Table 2; Fig. 2). Three of the commonly amplified sequences 
(TOP2A, ERBB2, and EST AA552509) that map to 17q21 were also 
overexpressed frequently in our cDNA expression analyses. The two 
most consistently affected transcripts were EST AA552509 (82%) and 
the TOP2A (82%). 

Other frequently overexpressed genes included AOC3 (45%), JUP 
(36%), ERBB2 (27%), ITAG3 (27%), and KRT14 (27%) at 17q21 
region, as well as AP2B1 at 17ql2, EST AA284262 at I7q23, and 
GRB2 at 17q25 (Table 2; Fig. 2). 

Northern Blotting. Northern analysis was used as an independent 
expression assay to validate the cDNA microarray results. Because of 
the limited availability of RNA from the xenografted tumors, only cell 
lines were analyzed. Three genes, EST AA552509, TOP2A, and 
ERBB2, that showed overexpression in one or more cell lines by 
cDNA microarray were selected for analysis. Results from the North- 
ern analysis confirmed the cDNA microarray data. ERBB2 was highly 
overexpressed in CRL-5822 cell line, TOP2A in all three cell lines, 
and EST AA552509 in CRL-5822 and CRL-5973 (Fig. 3). These 
genes were not expressed in the normal gastric epithelial sample or the 
gastric cell line (CRL-1739) that had normal chromosome 17 DNA 
copy numbers by chromosomal CGH (Fig. 3). 



Multiplex RT-PCR. Expression analyses with RT-PCR showed 
elevated expression of TOP2A and EST AA552509 in all tested tumor 
samples, whereas no expression was seen in the pool of normal gastric 
epithelial tissues (Fig. 3). The xenografts and their corresponding 
primaries showed similar levels of expression. 

DISCUSSION 

Studies by chromosomal CGH have indicated that 17q is amplified 
frequently in gastric cancer. Here we used a custom-made cDNA 



cm cn 
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■0.3 




AP2B1 

EST AA552509 
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AOC3 
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ESTAA284262 
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<3 3 5 >7 
Ratio 



5 Internet address: http://www.ncbi.nlm.nih.gov/genemap. 

6 Internet address: http://genome.ucsc.edu. 



Fig. 2. Expression patterns of the most commonly overexpressed genes in gastric 
cancer xenografts and cell lines. Names of the genes are indicated on the right. Color 
coding for the expression ratios is shown below the graph. This image was created using 
Tree view software written by Michael Eisen, copyright 1998-1999, Stanford University. 
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Fig. 3. Validation of overexpressed genes in 
gastric cancer. A, Northern analysis of TOP2A, 
ERBB2, and EST AA552509 expression in normal 
gastric tissue and four gastric cancer cell lines. 
CRL-1739 had normal copy numbers by CGH. The 
size of each transcript is indicated on the right side 
of the corresponding picture. GAPDH was used as 
a loading control. B, expression analysis by multi- 
plex RT-PCR in normal gastric tissue, seven xe- 
nografts (indicated by X- number), and six primary 
gastric cancers (indicated by G-number). Xe- 
nografts and their corresponding primary cancers 
have the same number. The names of the gene are 
shown on the right. 
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microarray that contained 636 cDNA clones from chromosome 17 to 
systematically analyze the copy number changes at 17q in eight 
gastric cancer xenografts and three cell lines. The CGH microarray 
analyses showed increased copy number ratios for 18 clones that were 
localized to the 17ql2-q21 region. To identify those genes that are 
activated through increased copy number, we performed a compre- 
hensive gene expression profiling using the same chromosome 17- 
specific cDNA microarray. Three of the commonly amplified tran- 
scripts (TOP2A, ERBB2, and EST AA552509) that map to 17q21 
were overexpressed frequently in our analyses and might, therefore, 
represent putative amplification target genes in gastric cancer. The 
cDNA microarray results were validated using Northern and RT-PCR 
analyses. 

The two most frequently overexpressed genes in our samples were the 
EST AA552509 and TOP2A. In addition, ERBB2 was also amplified and 
overexpressed in >30% of tumors. Our data show that these genes are 
overexpressed in gastric cancers with no indication of their expression in 
normal gastric epithelial tissues. The overexpression of EST AA552509 
has not been reported before and might be important for gastric carcino- 
genesis or have a possible value as a tumor marker or therapeutic target. 
On the other hand, the importance of TOP2A, and ERBB2 in cancer, 
especially breast cancer, is well known (15, 16). TOP2A is an enzyme 
that catalyzes ATP-dependent strand-passing reactions and functions in 
DNA replication and chromosome condensation and segregation (17). 
TOP2A is a molecular target for many anticancer drugs (topo2 inhibitors). 
ERBB2 is amplified frequently in breast cancer and has been shown to be 
an independent prognostic factor (18, 19). In breast cancer, TOP2A is 
often coamplified with ERBB2 (20, 21). In our gastric adenocarcinomas, 
amplification and overexpression of TOP2A were independent of and 
also more frequent than ERBB2. Previous studies of ERBB2 in gastric 
cancer have shown that the frequency of its overexpression varies from 9 
to 38% (22, 23), which is in agreement with our findings. Our results 
provide additional evidence that clinical studies are required to determine 
the possibility that TOP2A and ERBB2 are useful targets for cancer 
therapy in gastric cancer patients with these molecular alterations. 

The up-regulation of GRB2, JUP, and ITAG3 genes in the present 
study supports our earlier results that show these genes to be overex- 
pressed in gastric cancer (7). Interestingly, studies in breast cancer 
suggest that GRB2 may mediate transmission of ERBB2 oncogenic 
signals, which in turn activate mitogen-activated protein kinase path- 
way (24, 25). GRB2 is a widely expressed protein, which plays a 
crucial role in activation of several other growth factors (26). 

KRT14, AOC3, and AP2B1 were overexpressed in >3 of 1 1 of our 
gastric cancers. Copper-containing amino oxidases, such asAOC3, are 
involved in the catabolism of putrescine and histamine and are also 
involved in the regulation of growth and apoptosis (27). The AP2B1 
is a member of AP complexes that function as vesicle coat compo- 
nents in different membrane traffic pathways. AP-2 complex associ- 
ates with the plasma membrane and directs the internalization of 



trafficking cell surface protein (28). However, there is no information 
about the possible role of these genes in cancer. 

Our study has identified genes that are coamplified at 17ql2 and 
17q21 amplicons that are not altered transcriptionally in comparison 
of tumors to normal reference samples. The lack of correlation be- 
tween some amplified genes and their expression profile suggests that 
these genes are not critical targets at the 1 7q amplicon but might be 
coamplified together with critical genes within the amplicon structure. 
We also found genes that were overexpressed but not amplified by 
CGH microarrays. These results in CGH microarray may be attributed 
to the resolution of CGH-based technologies. On the other hand, 
upstream gene regulation and/or mutations are known as important 
biological mechanisms in transcriptional regulation irrespective of 
gene copy number. 

Comparison of this gastric cancer study with our earlier data from 
breast cancer using the same cDNA microarray revealed a different 
pattern of alterations affecting chromosome 17 (8, 14). In breast 
cancer, two common regions of increased copy number and expres- 
sion, 17ql2-q21 and 17q23, were observed. In addition, the genes 
influenced by the 17ql2-q21 amplification in gastric cancer differed 
from those in breast cancer where ERBB2 was among the most 
strongly affected (8, 14). These results indicate that although 17q is 
involved frequently in copy number alterations in several cancers, the 
target loci and genes might be different from one tumor type to 
another. 

In summary, the present study demonstrates that although the 17q 
region contains hundreds of genes, only three genes were frequently 
amplified and overexpressed in gastric cancers, as compared with 
normal gastric epithelial tissues. The consistent overexpression of 
TOP2A in our gastric cancers suggests that this gene may be a 
potential target for topo2 inhibitors in gastric cancer patients. The 
overexpression of EST AA552509, in the majority of our samples, 
suggests that this novel gene may play a critical role in gastric 
tumorigenesis. We have initiated additional studies to explore the 
possible biological and clinical significance of these genes in gastric 
cancer development and progression. 
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Abstract 

Purpose: This study aims to identify differentially ex- 
pressed genes in esophageal squamous cell carcinoma 
(ESCC) through the use of a membrane-based cDNA array. 

Experimental Design: Two newly established human 
ESCC cell lines (HKESC-1 and HKESC-2) and one corre- 
sponding to a morphologically normal, esophageal epithe- 
lium tissue specimen, prospectively collected from the 
HKESC-2-related patient, were screened in parallel using a 
cDNA expression array containing gene-specific fragments 
for 588 human genes spotted onto nylon membranes. 

Results: The results of cDNA expression array showed 
that 53 genes were up-regulated 2-fold or higher and 8 genes 
were down-regulated 2-fold or higher in both ESCC cell 
lines at the mRNA level. Semiquantitative RT-PCR analysis 
of a subset of these differentially expressed genes gave re- 
sults consistent with cDNA array findings. Four of the dif- 
ferentially expressed genes that belong to the categories of 
oncogenes/tumor suppressor genes (Fra-1 and Neogenin) 
and cell cycle-related genes (Id-1 and CDC25B) were studied 
more extensively for their protein expression by immuno- 
histochemistry. The two ESCC cell lines and their corre- 
sponding primary tissues, 61 primary ESCC resected spec- 
imens and 16 matching, morphologically normal, esophageal 
epithelium tissues were analyzed. The immunostaining re- 
sults showed that Fra-1, Neogenin, Id-1, and CDC25B were 
overexpressed in both ESCC cell lines and their correspond- 
ing primary tumors at the protein level, validating the mi- 
croarray findings. The results of the clinical specimens 
showed that the Fra-1 gene was overexpressed in ESCC 
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compared with normal esophageal epithelium in 53 of 61 
cases (87%), Neogenin in 57 of 61 cases (93%), Id-1 in 57 of 
61 cases (93%), and CDC25B in 48 of 61 cases (79%). 
Furthermore, the expression of Fra-1, Neogenin, and Id-1 in 
ESCC correlated with tumor differentiation. 

Conclusions: Overall, this study demonstrates that mul- 
tiple genes are differentially expressed in ESCC and pro- 
vides the first evidence that oncogenes Fra-1 and Neogenin 
and cell cycle-related genes Id-1 and CDC25B are overex- 
pressed in ESCC. 

Introduction 

Esophageal carcinoma is the ninth most common human 
cancer in the world and the second most common cancer in 
China (1). In Hong Kong, ESCC 3 accounts for -90% of esoph- 
ageal malignant tumors and is the sixth most common cause of 
cancer death (2). Despite advances in multimodality therapy, the 
overall 5-year survival rates for ESCCs still remain poor (3). 
The development of new treatment modalities, diagnostic tech- 
nologies, and preventive approaches will require a better under- 
standing of the molecular mechanisms underlying esophageal 
carcinogenesis. Although recent reports have documented alter- 
ations of a few oncogenes and tumor suppressor genes in ESCC, 
the molecular and genetic basis of esophageal carcinogenesis 
still remains largely unknown (4, 5). 

With the emerging technology of cDNA array hybridiza- 
tion, it is now possible to screen for alterations in the expression 
of many genes simultaneously (6-8). Because the development 
and progression of cancer are accompanied by complex changes 
in patterns of gene expression (9, 10), the cDNA array technol- 
ogy provides a very useful tool for studying these complex 
processes (6). In this study, we used cDNA expression array 
hybridization to examine the expression of 588 genes in two 
newly established ESCC cell lines (HKESC-1 and HKESC-2) 
and one corresponding, morphologically normal, esophageal 
epithelium tissue specimen collected prospectively from the 
HKESC-2-related patient. By comparing gene expression levels 
between normal esophageal epithelium and the ESCC cell lines, 
we were able to identify the differentially expressed transcripts 
in ESCC. Subsequent semiquantitative RT-PCR analyses vali- 



3 The abbreviations used are: ESCC, esophageal squamous cell carci- 
noma; Fra-1, fos-related antigen 1; CDC25B, cell division cycle 25B; 
Id- J, inhibitor of differentiation 1 (inhibitor of DNA binding 1); IH, 
immunohistochemistry; GAPDH, gIyceraldehyde-3-phosphate dehydro- 
genase; RT-PCR, reverse transcription-PCR; FPR], formyl peptide re- 
ceptor I ; RANTES, regulated on activation, normal T expressed, and 
secreted; AP-1, activator protein- 1; CDK, cyclin-dependent kinase; 
DAB, 3,3'-diaminobenzidine. 
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dated the cDNA array results. Expression of the protein products 
of four of these differentially expressed genes that belong to the 
categories of oncogenes/tumor suppressor genes (Fra-1 and 
Neogenin) and cell cycle-related genes (Id- 1 and CDC25B) was 
further evaluated by IH in a large series of ESCC tumor spec- 
imens. 

Materials and Methods 

Cell Culture and Tissue Specimen. Two human ESCC 
cell lines, HKESC-1 and HKESC-2, have been established 
recently in our laboratory. HKESC-1 has been reported previ- 
ously (11). Both cell lines were from Hong Kong Chinese 
patients with moderately differentiated ESCC; HKESC-I was 
from a 47-year-old man, whereas HKESC-2 was from a 46- 
year-old woman. The squamous epithelial nature of HKESC-2 
was confirmed by both electron microscopy (presence of tono- 
filaments and desmosomes) and immunohistochemical staining 
(positive for cytokeratins; data not shown). Both cell lines grew 
as adherent monolayers (II). HKESC-2 was maintained in the 
same conditions as HKESC-1 (11). Cells were harvested from 
passage 3 1 of HKESC-1 and passage 4 of HKESC-2 at 80-90% 
confluency, respectively. 

One morphologically normal, esophageal epithelium tissue 
specimen, collected prospectively from the HKESC-2-related 
patient, was used as a control for the array experiment. For 
obtaining high-purity normal esophageal epithelium tissue spec- 
imen, the morphologically normal esophageal epithelium at 
least 5 cm away from the tumor margin was carefully dissected 
out from other tissues of the freshly resected esophagectomy 
specimen from the HKESC-2-related patient and evaluated mi- 
croscopically. Unfortunately, the collected normal esophageal 
epithelium tissue from the HKESC-1 -related patient could not 
be used as a control because the specimen was too small and 
only a small amount of RNA could be extracted from it. 

cDNA Arrays, Probes, Hybridization, and Data Analy- 
sis. Atlas Human cDNA Expression Array membranes used in 
this study were purchased from Clontech (Palo Alto, CA). The 
membrane contained 10 ng of each gene-specific cDNA from 
588 known genes and 9 housekeeping gene fragments (Fig. 1). 
Several plasmid and bacteriophage DNAs and blank spots are 
also included as negative and blank controls to confirm hybrid- 
ization specificity. 4 

Total RNA was extracted using the Trizol reagent protocol 
(Life Technologies, Inc., Gaithersburg, MD) from the two 
ESCC cell lines (HKESC-1 and HKESC-2) and one correspond- 
ing, morphologically normal esophageal epithelium collected 
prospectively from the HKESC-2-related patient. mRNA was 
then isolated from the total RNA using the Straight A's mRNA 
Isolation System (Novagen, Madison, WI). The 32 P-labeled 
cDNA probes were generated by reverse transcription of 1 u-g of 
mRNA of each sample in the presence of [a- 32 P]dATP. Equal 
amounts of cDNA probes (3 X 10 6 cpm/uJ) from the ESCC cell 



lines and normal esophageal epithelium were then hybridized to 
separate Atlas Human cDNA array membranes for 24 h at 42°C 
and washed according to the supplier's instructions. The array 
membranes were then exposed to X-ray film at — 70°C for 2-5 
days. Autoradiographic intensity was analyzed using Atlaslm- 
age analysis software (version 1.01; Clontech). The signal in- 
tensities were normalized by comparing the expression of 
housekeeping genes GAPDH (G12) and HLA-C (G14): 



Intensity ratio = 



Adjusted intensity on array 
- HKESC-1 or HKESC - 2 
Adjusted intensity on array - normal 



X Normalization coefficient 



Adjusted intensity = Intensity - Background 
Normalization coefficient 

(Adjusted intensity GAPDH on array - normal \ 
Adjusted intensity GAPDH on array j 
- HKESC-1 or HKESC-2 / 



[Adjusted intensity HLA - Con array - normal \ 
Adjusted intensity HLA — Con array J 
- HKESC-1 or HKESC-2 / 



-s- 2 



4 A complete list of the 588 genes with array positions and GenBank 
accession numbers of the Atlas Human Expression Array used here can 
also be accessed through the web site http://www.clontech.com/clon- 
tech/APR97UPD/Atlas!ist.htmI. 



Genes were considered to be up-regulated when the intensity 
ratio between expression in the ESCC cell lines compared with 
normal esophageal epithelium was 2-fold or greater. Genes were 
labeled as down-regulated when the ratio between normal and 
ESCC cell lines was 2-fold or higher. 

RT-PCR. cDNA was generated using 1 \xg of total RNA 
from the two ESCC cell lines (HKESC-1 and HKESC-2) and 
one corresponding, morphologically normal, esophageal epithe- 
lium collected prospectively from the HKESC-2-related patient 
as template and 2.5 mM Oligo d(T) 16 primers in a 20-p.l reaction 
mixture, and the reverse transcription was carried out at 42°C 
for 1 h, followed by 95°C for 10 min using the GeneAmp RNA 
PCR Core kit (Perkin-EImer, Branchburg, NJ). Two p,I of 
cDNA were amplified in a 25-u.l PCR reaction mixture contain- 
ing IX PCR buffer, 1.9 or 2.4 or 2.9 mM MgCl 2 , 0.5 p-M 
primers, 0.18 mM deoxynucleotides triphosphates, 1 unit of 
AmpIiTaq Gold DNA polymerase with hot-start PCR as fol- 
lows: 95°C for 10 min, followed by 25-35 cycles of 1 min 
denaturation at 94°C, 1 min annealing at 60°C (for primers of 
cyclin Dl, 1d-l, CDC25B, FPR1, RANTES, and GAPDH) or 
62°C (for primers of Fra-1) or 65°C (for primers of Neogenin), 
1 min extension at 72°C. Finally, PCR products were fully 
extended by incubating at 72°C for 10 min. The PCR reagents 
were purchased from Perkin-EImer. 

The sequences of gene specific primers for RT-PCR were 
the same as those of cDNA array (data not shown because of the 
copyright agreement by Clontech, Palo Alto, CA), except for the 
primers specific for Fra-1, which were the same as described 
before (12). All of the primers were synthesized by Integrated 
DNA Technologies, Inc. (Coralville, IA). The cycle number was 
optimized for each gene-specific primer pair to ensure that 
amplification was in the linear range and the results were 
semiquantitative. Twelve u-1 of PCR product were visualized by 
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Fig. 1 A-C, gene expression profiles of two human ESCC cell lines 
HKESC-1 (A) and HKESC-2 (B) and one morphologically normal 
esophageal epithelium (Q from the HKESC-2-related patient using the 
Atlas Human cDNA Expression Array. Some of the differentially ex- 
pressed genes are indicated: 1, c-myc (Ala); 2, Fra-1 (A4f); 3, Neogenin 



electrophoresis on a 2% agarose gel stained with ethidium 
bromide and quantitated by densitometry using a dual-intensity 
transilluminator equipped with Gelworks ID Intermediate soft- 
ware (version 2.51). 

Collection of Tissues and Clinicopathological Data. 
The tissues were obtained from 61 (50 men and 1 1 women) 
patients with ESCC resected between 1996 and 1998 in Queen 
Mary Hospital, The University of Hong Kong. The patients' 
ages ranged from 41 to 83 years, with a mean age of 65 years. 
The specimens were dissected and examined in the fresh state. 
Representative tissue specimens from tumors and matching nor- 
mal esophageal epithelium tissues were snap-frozen in liquid 
nitrogen and stored at — 80°C. Other representative blocks were 
taken and processed in paraffin for histological examination. 
The carcinomas were found in the upper (n = 10; 16%), middle 
(n = 35; 57%), and lower (n = 16; 26%) third of the esophagus. 
The median length of the tumors was 5.5 cm (range, 1-11). The 
histology of the carcinomas was reviewed according to the 
criteria described previously (13). The ESCC tumors were well 
differentiated in 20 (33%) cases, moderately differentiated in 29 
(48%), and poorly differentiated in 12 (20%). The carcinomas 
were staged according to the Tumor-Node-Metastasis classifi- 
cation (14). Many tumors were stage III (n = 35, 57%) or II 
(n — 23, 38%); of the remainder, one was stage I and two were 
stage IV. 

Immunohistochemistry. Expression of Fra-1, Neoge- 
nin, Id- 1, and CDC25B was investigated by the streptavidin- 
biotin-peroxidase complex method. Briefly, 6-jxm frozen sec- 
tions were cut from two pellets harvested from cultured cell 
lines HKESC-1 and HKESC-2, the cell lines' corresponding 
primary tissues, 61 primary ESCC tumors, and 16 matching, 
morphologically normal, esophageal epithelium specimens. Af- 
ter endogenous peroxidase activity was quenched and nonspe- 
cific binding was blocked, polyclonal rabbit anti-Fra-1, goat 
anti-Neogenin, rabbit anti-Id- 1, and goat anti-CDC25B antibod- 
ies (Santa Cruz Biotechnology, Santa Cruz, CA) were incubated 
at 4°C overnight at a dilution of 1:100 for Fra-1, 1:40 for 
Neogenin and ld-1, and 1:30 for CDC25B, respectively. The 
secondary antibody was biotinylated swine anti-rabbit (for 
Fra-1 and ld-1) or rabbit anti-goat (for Neogenin and CDC25B) 
antibody (DAKO, Glostrup, Denmark) used at a dilution of 
1 :200 for 30 min at 37°C. After washing, sections were incu- 
bated with StreptABComplex/horseradish peroxidase (DAKO; 
1:100 dilution) for 30 min at 37°C. Negative controls were 
performed by replacing the primary antibody by normal serum. 



(A4n); 4, ld-1 (A5e); and 5, CDC25B (A7m). D, schematic diagram of 
Atlas Human cDNA Expression Array. The array contains 588 human 
genes spotted in duplicate and divided into six functional categories 
{Quadrants A-F). Three blank (Gl, G8, and G15) and nine negative 
(G2-4, G9-11, and G 16-1 8) controls are included to confirm hybrid- 
ization specificity. Nine housekeeping genes (G5-7, G12-I4, and G19- 
21) are also included in the array for normalizing mRNA abundance. 
Genomic DNA spots (dark dots) serve as orientation marks to facilitate 
in the determination of the coordinates of hybridization signals. A 
complete gene list with array coordinates and GenBank accession num- 
bers is available at world wide web site http://www.clontech.com/ 
clontech/APR97 UPD/Atlaslist.html. 
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Each section was independently assessed by two histopatholo- 
gists without prior knowledge of the patients' other data. Scor- 
ing was based on the percentage of positive cells. The staining 
was identified as: -, no expression; +, <10% of cells were 
stained; + + , 10-50% of cells stained; and + ++, >50% of cells 
stained. From ++ to + + + was defined as overexpression. 

Statistical Analysis. Comparisons between groups were 
performed using the \ 2 test and / test when appropriate. P < 
0.05 was used to determine statistical significance. All statistical 
tests were performed with the GraphPad Prism software version 
3.0 (GraphPad Software, Inc., San Diego, CA). 

Results and Discussion 

In this study, we first used cDNA expression array hybrid- 
ization to identify genes that were differentially expressed in 
ESCC compared with normal esophageal epithelium. Two 
newly established ESCC cell lines in our laboratory were se- 
lected for cDNA array analysis to assure large quantities of 
high-purity tumor mRNA. The comparison of the autoradio- 
graphic intensities between ESCC cell lines and normal esoph- 
ageal epithelium showed that 65 and 59 genes were up-regulated 
2-fold or higher and 11 and 21 genes were down-regulated 
2-fold or more in HKESC-1 and HKESC-2, respectively. 
Among these differentially expressed genes, 53 genes were 
up-regulated and 8 genes were down-regulated in both cell lines 
(Fig. 1 and Table 1). No signals were visible in the three blank 
spots (Gl, G8, and G15) and nine negative control spots (G2-4, 
G9-1 1, and G16-18; Fig. 1), indicating that the cDNA array 
hybridization was highly specific. Among the 61 differentially 
expressed genes in both cell lines, 49 genes such as Fra-J, 
Neogenin, Id- 1, and CDC25B genes were identified as differ- 
entially expressed in ESCC for the first time; 12 other differ- 
entially expressed genes have been described to be overex- 
pressed in ESCC previously. The genes overexpressed in both of 
the ESCC cell lines belong to the categories of oncogenes/tumor 
suppressor genes, cell cycle-related genes, genes for DNA syn- 
thesis, DNA binding genes, or apoptosis-related genes (Table 1). 
The 8 genes that were down-regulated in both ESCC cell lines 
comprised genes for signal transduction (guanine nucleotide 
regulatory protein NET}, protein kinase C-fJ //, cAMP-depend- 
ent protein kinase catalytic d-subunit and EPLG3), genes for 
signaling proteins (RANTES protein T-cell specific, Somatome- 
din A, and FPR1), and the gene for MAL protein. These findings 
demonstrated that multiple genes are differentially expressed in 
ESCC at mRNA level. 

To further validate the cDNA array approach, we per- 
formed semiquantitative RT-PCR to analyze the expression 
levels of 8 genes, cyclin Dl, Fra-J, Neogenin, Id- J, CDC25B, 
FPR1, RANTES, and GAPDH. The results of RT-PCR analysis 
(Fig. 2) were consistent with the expression profiles obtained 
through cDNA array hybridization (Fig. 1). 

Genes that belong to the categories of oncogenes/tumor 
suppressor genes and cell cycle-related genes are often impli- 
cated in the pathogenesis of various cancers (4, 5, 15). Signif- 
icantly, a number of the differentially expressed genes identified 
by cDNA array hybridization in both ESCC cell lines belong to 
these categories. Four of these differentially expressed genes 
that were identified for the first time to be overexpressed in 



ESCC in this study, the oncogenes Fra-J and Neogenin and the 
cell cycle related genes Id- J and CDC25B, were selected for 
more detailed study for their protein expression in a large series 
of ESCC tumor specimens by IH. Moreover, these genes have 
been reported to be overexpressed in other tumor cell lines or 
primary tumors (16-24). The other consideration for selecting 
these particular genes for more extensive study was that the 
suitable antibodies of these genes were commercially available 
for the IH studies. The protein expression of these four genes 
was investigated in the two ESCC cell lines and their corre- 
sponding primary tissues, 61 primary ESCC tumors, and 16 
matching, morphologically normal, esophageal epithelium spec- 
imens. The results of immunostaining are summarized in Tables 
2 and 3 and are shown in Fig. 3. The protein products of Fra-I, 
Neogenin, Id- J, and CDC25B genes were found to be overex- 
pressed in both the ESCC cell lines and their corresponding 
primary tumors (Table 2), validating the cDNA array results. 

Fra-I is one component of the AP-1 complex (25). The 
AP-1 components are considered to play key roles in signal 
transduction pathways involved in complex cellular growth, 
differentiation, and tumorigenesis (16). Previous studies indi- 
cated that increased AP-1 activity is a necessary event in the 
transformation of mouse epidermal cells (26, 27). Fra-J over- 
expression has been found in kidney and thyroid cancer (16, 17). 
These observations suggest that Fra-I overexpression might 
play an important role in malignant transformation of epithelial 
cells. In the present work, Fra-J mRNA overexpression was 
detected in both ESCC cell lines by cDNA array analysis (Fig. 
1) and RT-PCR (Fig. 25). Also, the majority of ESCC tumors 
(53 of 61, 87%; Table 3; Fig. 3B) had enhanced expression of 
Fra-L Fra-l protein expression was localized in the nuclei of 
ESCC tumor cells (Fig. 3B). In contrast, morphologically nor- 
mal, esophageal epithelium tissues showed low expression of 
Fra-l. The expression of Fra-l was often focal in morphologi- 
cally normal esophageal epithelium and always restricted to the 
basal cell layer (Fig. 3A). The well or moderately differentiated 
ESCC showed more intense expression of Fra-l than poorly 
differentiated ones (P < 0.0001; Table 3). 

Neogenin encodes a 1461 -amino acid protein with 50% 
amino acid identity to DCC {deleted in colorectal cancer, Ref. 
18). It has been suggested to play an integral role in regulating 
differentiation and/or cell migration events within many embry- 
onic and adult tissues (28). Neogenin expression has been de- 
tected at low levels in many adult tissues but not including 
esophagus (18). Overexpression of Neogenin has been observed 
in a wide variety of human cancer cell lines from cancers of 
breast, pancreas, brain, cervix, colon, and rectum (18). How- 
ever, there is no information about the status of Neogenin 
expression in human primary cancers including esophageal can- 
cer. In the current study, our cDNA array (Fig. 1) and RT-PCR 
(Fig. IB) results showed that Neogenin mRNA was overex- 
pressed in both ESCC cell lines. Neogenin protein overexpres- 
sion was noted in 93% (57 of 61) of ESCCs (Table 3; Fig. 3D). 
The expression was localized in the cytoplasm of tumor cells 
(Fig. 3D). In contrast, the expression of Neogenin protein in 
morphologically normal esophageal epithelium was negative or 
negligible and was restricted to the highly proliferative basal 
cells (Fig. 3Q. The well or moderately differentiated ESCC 
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Table 1 List of differentially expressed genes in the ESCC cell lines HKESC-1 and HKESC-2 when compared with one corresponding, 
morphologically normal, esophageal epithelium tissue specimen (N) from the HKESC-2-related patient using cDNA expression array 

Intensity Ratio 



Location 



Name of gene 



HKESC-I/N 



HKESC-2/N 



Genes up-regulated in both ESCC cell lines 
Oncogene/Tumor suppressor genes 



Ala 
A2b 
A3b 
A3i 
A3k 
A4b 
A4c 
A4f 
A4g 
A4h 
A4j 
A4k 
A41 
A 4 m 
A4n 



c-myc 

1GFBP-2 

Snon 

rhoA (MDR protein) 

DCC 

APC 

BRCA2 

Fra-1 

Ezrin 

JVN-D 

PEPI 

EBI 

C-CBL 

Smadl 

Neogenin 



Cell cycle-related genes 

A5e Id-I 

A5g P58/GTAJ 

A6g Cyclin DI 

A61 Cyclin BI 

A6m Cyclin E 

A7b Cyclin G2 

A7d p35 

A71 C-l 

A7m CDC25B 
Apoptosis-associated genes 

C I i Adenosine A J receptor 

C41 Apopain 
Genes for DNA synthesis/repair/recombination proteins 

C6d XRCC1 

C61 DM4 Topoisomerase II 

C7n Dnase X 
Genes for DNA binding/transcription factors 

Die CCAT-binding protein 

Did Id-3 

Die BTEB2 

Dig Id-2 

D1I TAX 

Din CNBP 

D2a C CAA T displacement protein 

D2c APRF 

D2d hSNF2b 

D2f TAXREB67 

D2i TCF5 

D3a hSNF2a 

D3b DBJ 

D3c D-binding protein 

D3g PAX-8 

D3j PI 5 subunit 

D3k Guanine nucleotide-binding protein G-S 

D4c AP-2 

D4j NF-EI 

D5k PAX3 

D7k TAFII3I 
Genes for signal proteins 

F5a NGF-2 

F5b MIP2a 

F5f IL-8 
Genes down-regulated in both ESCC cell lines 

Gene for iron channel/transport protein 

Bib MAL protein 
Genes for signal transduction 

B4g Guanine nucleotide regulatory protein NET1 

B5j Protein kinase c-f3 // 

B6b c AMP -dependent protein kinase a-subunit 

B6n EPLG3 
Genes for cell signaling proteins 

Fla Somatomedin A 

F 1 k FMLP- related receptor I 

F2j RANTES protein T-cell specific 



2.5 
6.9 
7.2 
2.8 
5.1 
2.6 

30029/0 
10.8 

21936/0 
35.4 
3.6 
2.6 
2.7 
3.2 
2.4 

3.4 
4.0 
2.1 
2.4 
3.5 
2.4 
6.9 
2.6 
2.0 

22.2 
8.7 

3.2 
14759/0 
5.1 

9.9 
6.5 
7.4 
9.8 
3.0 
4.3 
4.2 
33987/0 

42.2 
39440/0 
37478/0 
7.1 
316.5 
37796/0 

13.6 
38725/0 
33.0 
7.1 
34506/0 
24.8 
7.0 

16.7 
39223/0 
39239/0 



1/3.3 

0/28564 
0/29588 
0/41060 
0/22304 

0/19560 
0/29220 
0/36868 



3.0 
6.0 
6.0 
3.9 
3 

3.9 
42730/0 

7.8 
36475/0 
60.6 
4.8 
3.0 
2.2 
3.0 
3.2 

3.7 
2.2 
2.3 
3.2 
3.3 
3.1 
7.2 
2.6 
2.5 

5.9 
13.7 

2.9 
22201/0 
4.4 

8.8 
6.6 
9.0 
7.6 
4.4 
2.1 
4.5 

2901 1/0 
40.7 

49383/0 

23080/0 
4.2 
225.0 

21621/0 
7.2 

40864/0 
49.2 
4.9 

15924/0 
10.6 
4.8 

4.6 
19827/0 
1 1638/0 



0/11989 

1/23.7 
0/29588 
1/304 
1/2.5 

0/19560 
0/29220 
1/3.2 
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Fig. 2 RT-PCR analysis of cyclin Dl, Fra-1, 
Neogenin, Id-1, CDC25B, FPR1, RANTES, and 
GAPDH genes in ESCC cell lines HKESC-1 and 
HKESC-2 and one corresponding, morphologi- 
cally normal epithelium (Normal) from the 
HKESC-2-related patient. A, determination of op- 
timal number of PCR cycles for different gene- 
specific primer pairs. mRNA from HKESC-1 was 
used to determine the optimal number of PCR 
cycles for genes cyclin Dl, Fra-1, Neogenin, ld-l, 
CDC25B, and GAPDH. mRNA from the normal 
esophageal epithelium was used to determine the 
optimal number of PCR cycles for genes FPR1 
and RANTES. B, expression of cyclin Dl (25 cy- 
cles), Fra-1 (32 cycles), Neogenin (30 cycles), 
ld-l (25 cycles), CDC25B (28 cycles), FPR1 (35 
cycles), RANTES (28 cycles), and GAPDH (25 
cycles) genes in two ESCC cell lines HKESC-1 
and HKESC-2 and one corresponding, morpholog- 
ically normal esophageal epithelium (Normal) 
from the HKESC-2-related patient. 




Table 2 Summary of immunohistochemical staining results in ESCC cell lines and their corresponding primary tissue specimens 







Cell lines 




Primary tissues" 






HKESC-1 


HKESC-2 




N, 




N 2 


Fra-1 




+ + + 


+ + + 


+ J\ 


+ + 


+ 


Neogenin 


+ + + 


+ + + 


+ + + 




+ + 




Id-1 


+ + + 


+ + + 


+ + + 


+ + 


+ + 




CDC25B 


+ + + 


+ + + 


+ + -f 






+ 



a T, ESCC tumor; N, morphologically normal esophageal epithelium. 

b Expression: no expression; + , <10% cells positive; + + , >10% and <50% cells positive; + + +, >50% cells positive. ++ to + + + was 
considered as overexpression. 



showed more intense expression of Neogenin than poorly dif- 
ferentiated ones (P = 0.0047; Table 3). 

ld-l is a cell cycle-related gene that encodes a helix-loop- 
helix protein. Id-1 plays an important role not only in suppress- 
ing cellular differentiation but also in enhancing cellular prolif- 
eration (29, 30). Generally, Id-1 protein is highly expressed in 
growing cells, and its expression is down-regulated upon dif- 
ferentiation in many cell types. Although Id-1 is expressed in a 
variety of fetal tissues and overexpressed in tumors from brain 
and lung (19), its expression in ESCC is unknown. In this study, 



we observed that Id-1 mRNA was overexpressed in both ESCC 
cell lines (Figs. 1 and IB). Also, Id-1 protein overexpression 
was frequent in human primary ESCC tumors (57 of 61, 93%; 
Table 3; Fig. 3F). The Id-1 protein was localized in the cyto- 
plasm of tumor cells (Fig. 3F). In contrast, the expression of 
Id-1 protein in morphologically normal esophageal epithelium 
was either negative or negligible and was restricted to the basal 
and parabasal cells (Fig. 3£). The well or moderately differen- 
tiated ESCC showed more intense expression of Id-1 than 
poorly differentiated ones (P = 0.0156; Table 3). 



Clinical Cancer Research 2219 



Fig. 3 Photomicrographs of Fra-1, 
neogenin, Id*], and CDC25B ex- 
pression by IH in morphologically 
normal esophageal epithelium and 
ESCC. A, Fra-1 IH in morphologi- 
cally normal esophageal epithelium 
showing that Fra-1 expression was 
restricted to the basal cell layer (ar- 
row); DAB X160. B, Fra-1 IH in 
ESCC showing that the nuclei of 
tumor cells are strongly positive for 
Fra-1 \ DAB x 400. C Neogenin IH 
in morphologically normal esopha- 
geal epithelium showing that Neoge- 
nin expression was restricted to the 
basal cell layer (arrow); DAB 
X330. D, Neogenin IH in ESCC 
showing Neogenin expression in the 
cytoplasm of tumor cells; DAB 
X500. E, ld-1 IH in morphologi- 
cally normal esophageal epithelium 
showing ld-1 expression was re- 
stricted to the basal (arrow) and 
parabasal (arrowhead) cell layers; 
DAB X330. F, ld-1 IH in ESCC 
showing ld-1 expression in the cy- 
toplasm of tumor cells; DAB X500. 
G, CDC25B IH in morphologically 
normal esophageal epithelium show- 
ing lack of immunoreactivity; DAB X 
330. H, CDC25B IH in ESCC show- 
ing CDC25B expression in the nuclei 
of most tumor cells; DAB X500. 
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CDC25B is a cell cycle- related gene. Its product is a 
phosphatase that catalyzes the removal of inhibitory phosphate 
from the CDK family of proteins (31). CDC25B can dephos- 
phorylate threonine 14, tyrosine 15, or both on CDKs and 
activate cycIin/CDK complexes to stimulate cell proliferation 
(32). In vitro transforming experiments have demonstrated that 
CDC25B is also a potential oncogene (20). Overexpression of 



CDC25B has been found in cancers arising from breast (20), 
stomach (21), lung (22), and head and neck (23), and in non- 
Hodgkin's lymphoma (24). In this study, we demonstrated that 
the mRNA of CDC25B was highly expressed in both ESCC cell 
lines by cDNA array (Fig. 1) and RT-PCR (Fig. IB). Further- 
more, CDC25B was overexpressed in 79% (48 of 61) primary 
ESCC tumors by IH (Table 3; Fig. 3H). CDC25B protein 
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Table 3 Summary of IH staining results in clinical ESCC tumors and normal esophageal epithelium tissues 

Fra-1 Neogenin Id- 1 CDC25B 



Diagnosis -° + ++ + + + P - + ++ + + + P - + + + + + + P - + + + + + + P 

ssii;i?6., j *4 2 5 3 ; <^>r? ; 5 s <°™h ; 4 s ««««,; 2 ,s 2 s «««»■ 

Well (n= 20) 00911 1 0 2 17 013 16 2279 

Moderate (n = 29) 1 0 11 17 <0.0001 0 0 1 28 0.0047 0 0 3 26 0.0156 6 1 10 12 0.5720 

Poor (ft = 12) 3 4 2 3 2 14 5 3 0 2 7 2 0 2 8 

° Expression: no expression; + , <10% celts positive; ++, >10% and <50% cells positive; + ++, >50% cells positive. ++ to + + + was 
considered as overexpression. 



expression was localized mainly in the nuclei of tumor cells 
(Fig. 3H). On the other hand, the expression of CDC25B in 
morphologically normal esophageal epithelium tissues was ei- 
ther negative or very weak (Fig. 3G). In the case of CDC25B, 
there was no correlation between gene expression and ESCC 
differentiation (P = 0.5720; Table 3). 

In summary, ail four of the genes selected for further study 
demonstrated a significantly higher incidence of overexpression 
in primary ESCCs than morphologically normal esophageal 
epithelium tissues (P < 0.0001; Table 3). Furthermore, three of 
them, Fra-I, Neogenin, and ld-1 were more highly expressed in 
tumors with greater differentiation. CDC25B did not demon- 
strate this correlation (Table 3). The expression of these genes 
did not correlate to age at presentation or gender of patients or 
tumor site, size, or stage. The differentiation of squamous cell 
carcinoma bears no relationship with the stage of the tumor (4). 
In this study, the expression of Fra-I, Neogenin, and ld-1 was 
more often noted in the wel I/moderately differentiated squa- 
mous cell carcinoma. This is consistent with the theory that 
poorly differentiated squamous cell carcinoma arises at the early 
stage of carcinogenesis. In the later stages of tumor progression, 
the squamous cell carcinoma becomes more mature in appear- 
ance (well/moderately differentiated). 

Unfortunately, the complete follow-up data were available 
only for some of these patients. Nevertheless, all these patients 
died within 2 years of resection of the primary tumors. Also, the 
Fra-1, Neogenin, Id-], and CDC25B were highly expressed in 
squamous cell carcinomas. Thus, it is unlikely that the expres- 
sion of these genes acts as an independent prognostic factor in 
these tumors. 

Overall, our data demonstrate that multiple genes are dif- 
ferentially expressed in ESCC and show for the first time that 
oncogenes Fra-1 and Neogenin and cell cycle- related genes ld-1 
and CDC25B are overexpressed in ESCC. Additional studies are 
required to determine the roles of these and other differentially 
expressed genes in the molecular pathogenesis of ESCC. 
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Abstract 

The BMI-1 gene is a putative oncogene belonging to the Polycomb 
group family that cooperates with c-myc in the generation of mouse 
lymphomas and seems to participate in cell cycle regulation and senes- 
cence by acting as a transcriptional repressor of the INK4a/ARF locus. 
The BMI-1 gene has been located on chromosome 1 Op 13, a region involved 
in chromosomal translocations in infant leukemias, and amplified in 
occasional non-Hodgkin's lymphomas (NHLs) and solid tumors. To de- 
termine the possible alterations of this gene in human malignancies, we 
have examined 160 lymphoproliferative disorders, 13 myeloid leukemias, 
and 89 carcinomas by Southern blot analysis and detected BMI-1 gene 
amplification (3- to 7-fold) in 4 of 36 (11%) mantle cell lymphomas 
(MCLs) with no alterations in the INK4a/ARF locus. BMI-1 and pl6 INK4a 
mRNA and protein expression were also studied by real-time quantitative 
reverse transcription-PCR and Western blot, respectively, in a subset of 
NHLs. BMI-1 expression was significantly higher in chronic lymphocytic 
leukemia and MCL than in follicular lymphoma and large B cell lym- 
phoma. The four tumors with gene amplification showed significantly 
higher mRNA levels than other MCLs and NHLs with the BMI-1 gene in 
germline configuration. Five additional MCLs also showed very high 
mRNA levels without gene amplification. A good correlation between 
BMI-1 mRNA levels and protein expression was observed in all types of 
lymphomas. No relationship was detected between BMI-1 and pl6 INK4a 
mRNA levels. These findings suggest that BMI-1 gene alterations in 
human neoplasms are uncommon, but they may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 
type. 

Introduction 

The BM-1 3 gene is a putative oncogene of the Polycomb group 
originally identified by retroviral insertional mutagenesis in E/x-c- 
myc transgenic mice infected with the Moloney murine leukemia 
virus (1, 2). These animals had a rapid development of pre-B cell 
lymphomas showing frequent proviral insertions near the BMI-1 gene. 
This integration resulted in BMI-1 overexpression suggesting a coop- 
erative effect between C-MYC and BMI-1 genes in the development of 
these tumors (3, 4). Recent studies have indicated that the BMI-1 gene 
may also participate in cell cycle control and senescence through the 
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INK4a/ARF locus by acting as an upstream negative regulator of 
pl6 iNK4a and pl 4/ p i9ARF gene express j on (5). The human BMI-1 

gene has been mapped to chromosome 10pl3 (6), a region involved in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant T cell lymphomas (8, 9). More recently, high-level 
DNA amplifications of this region have been found by comparative 
genomic hybridization in NHLs and solid tumors (10, 11). However, 
the possible implication of the BMI-1 gene in these alterations and its 
role in the pathogenesis of human tumors is not known. The aim of 
this study was to analyze the possible BMI-1 gene alterations and 
expression in a large series of human neoplasms and to determine the 
relationship with lNK4a/ARF locus aberrations. 

Materials and Methods 

Case Selection. A series of 262 human tumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table 1), matched normal tissues from 
all carcinomas, 1 1 samples of normal peripheral mononuclear cells, and 5 
reactive lymph nodes and tonsils, were selected based on the availability of 
frozen samples for molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNase treatment. 15 >g were digested with EcoRl 
and //wdlll restriction enzymes (Life Technologies, Inc., Gaithersburg, MD), 
for Southern blot analysis and hybridized with a 1.5-kb Pstl fragment of the 
partial BMI-1 cDNA (6). 

RNA Extraction and Real-time Quantitative RT-PCR. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs) using guanidine/isothiocyanate extraction and cesium/chloride gradient 
centrifugation. One u.g of total RNA was transcribed into cDNA using 
MMLV-reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of the BMI-1 and the pi 6 
detection probes and primers were designed using the Primer Express program 
(Applied Biosystems, Foster City) as follows: BMI-1 sense, 5'-CTGGTTGC- 
CCATTGACAGC-3 ' ; BMI-1 antisense, 5 '-C AG AAAATG A ATGCG AG- 
CCA-3'; p!6 sense, 5 ' -C A ACGC ACCG A AT AGTT ACGG-3 ' ; pI6 antisense, 
5 '-AACTTCGTCCTCCAG AGTCGC-3 ' . The probes BMI-1, 5'-CAGCTC- 
GCTTC AAG ATGGCCGC-3 ' , and pI6, 5'-CGGAGGCCGATCCAGGTGG- 
GTA-3', were labeled with 6-carboxy-fluorescein as the reporter dye. The 
TaqMan-GAPDH Control Reagents (Applied Biosystems) were used to am- 
plify and detect the GAPDH gene, as recommended by the manufacturer. The 
quantitative assay amplified 1 /iml of cDNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosystems). All reactions were performed in an ABI PRISM 7700 Sequence 
Detector System (Applied Biosystems). GAPDH, BMI-1, and p!6 ,NK4a ex- 
pression was related to a standard curve derived from serial dilutions of Raji 
cDNA. The RUs of BMI-1 and pl6 ,NK4a expression were defined as the 
mRNA levels of these genes normalized to the GADPH expression level in 
each case. 

Protein Analysis. Whole-cell protein extracts were obtained from addi- 
tional frozen tissue available in 31 cases (7 CLLs, 12 MCLs, 8 FLs, and 4 
LCLs), loaded onto a 10% SDS-polyacrylamide gel, and electroblotted to a 
nitrocellulose membrane (Amersham). Blocked membranes were incubated 
sequentially with the monoclonal antibody BMI-F6 (12), antimouse conju- 
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Table 1 Hematological malignancies and solid tumor samples analyzed for BMI-1 
gene alterations 



Tissue samples 


No. of cases 


Hematological malignancies 




Hodgkin s disease 


2 


B cell lymphoproliferative disorders 


14 


R-Anitf" lvmnhnhlastic leukemia 


CLL 


29 


ridiry veil icunciuia 


4 


FL 


15 


MCL 


36 


LCL 


40 


T" r><»11 lirmnh nnrn 1 i tfV»rti 1t\/f* Hicf»rHf*r<t 

1 lcii lympiiupi uiiivi aii vc uuuiuci i 




T.Arnt<» Ivmnhnhl astir Ipukemia. 


8 


T arcrp onnnhr cell leukemia 
L^aigC glaliuiai veil iGUltciilia 


4 


Peripheral T-cell lymphoma 


8 


Myeloproliferative disorders 




Acute myeloid leukemia 


7 


Chronic myeloid leukemia 


6 


Solid tumors 




Colon carcinoma 


26 


Breast carcinoma 


29 


Laryngeal squamous cell carcinoma 


34 


Total 


262 



gated to horseradish peroxidase (Amersham), and detected by enhanced chemi- 
luminescence (Amersham) according to the manufacturer's recommendations. 

Statistical Analysis. Because of the non-normal distribution of the samples 
and the small size of some subsets of tumors, the statistical evaluation was 
performed using nonparametric tests (SPSS, version 9.0). Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
the Kruskal-Wallis Test, with a P for significance set at 0.05. For differences 
between particular groups, the conservative Bonferroni procedure was per- 
formed, and the P was set at 0.005. The remaining statistical analyses were 
carried out using the Mann- Whitney nonparametric U test (significance, P 
<0.05). The comparison between BMI-1 and pl6 ,NIC4a quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

Results 

BMI^l Gene Amplification. The BMI-1 gene was examined by 
Southern blot in a large series of human tumors and normal samples 
(Table 1). The cDNA probe used in the study detected three EcoRl 
fragments of 7.3, 3.8, and 2.6 kb and three //mdlll fragments of 6.2, 
4, and 3.5 kb. BMI-1 gene amplification (3- to 7-fold) was detected in 
4 of 36 (1 1%) MCLs (Fig. 1). The amplifications were confirmed with 
both restriction enzymes. The amplified MCLs were two blastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with their respective matched non- 
neoplastic mucosa. No BMI-1 gene rearrangements were observed in 
any of the samples examined. 

BMI-1 mRNA Expression. To determine the BMI-1 expression 
pattern in NHL we analyzed BMI-1 mRNA levels by real-time quan- 
titative RT-PCR in 67 lymphomas (10 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BMI-1 mRNA expression pattern was observed in the different types 
of lymphomas (Fig. 2; Kruskal-Wallis Test; P < 0.001). The BMI 
mRNA levels in CLLs (mean, 2.2 RU; SD, 1.3) and MCLs with no 
BMI-1 gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6 RU; 
SD, 0.4; Mann- Whitney nonparametric U test; P < 0.01). The 4 
MCLs with BMI-1 gene amplification showed significantly higher 
levels of expression than all other groups of tumors (mean, 5.1 RU; 
SD, 1.6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of the gene also showed very high levels of BMI-1 mRNA 
expression ranging from 4 to 9.8 RU, similar to cases with gene 
amplification (Fig. 1A). 

BMI-1 Protein Expression. BMI-1 protein expression was exam- 
ined by Western blot in 31 tumors (7 CLLs; 12 MCLs, including two 



cases with BMI-1 gene amplification and 4 cases with mRNA over- 
expression and no structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody against BMI-1 detected three closely migrating proteins of 
M r 45,000-48,000 (2). The two more slowly migrating bands prob- 
ably represent phosphorylated isoforms of the protein (12). The two 
MCLs with gene amplification and three of four cases with mRNA 
overexpression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that BMI-1 protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMI-1 and pl6 ,NK4a Gene Alterations. 

The INK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of the BMI-1 gene, 
suggesting that this gene may contribute to human neoplasias with 
wild type INK4/ARF (5). Most of the lymphoproliferative disorders 
analyzed in the present study, including the four cases with BMI-1 
gene amplification, had been previously examined for p53 gene mu- 
tations and INK4a/ARF locus alterations, including gene deletions, 
mutations, hypermethylation, and expression (13, 14). The four MCLs 
with BMI-1 gene amplification and mRNA overexpression and the 
five tumors with BMI-1 mRNA overexpression with no structural 
alterations of the gene showed a wild-type configuration of the 
lNK4a/ARF locus (13). However, one case with BMI-1 gene ampli- 
fication and one case with mRNA overexpression with no alteration of 
the gene showed p53 gene mutations associated with allelic deletions. 

To determine the possible relationship between BMI-1 and 
pl6 iNK4a mRNA expression, pl6 ,NK4a mRNA levels were evaluated 
by real-time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs, 
and 13 LCLs), including 6 cases with alterations in the INK4a/ARF 
locus (2 MCLs and 1 LCL with /?/<5 INK4a gene deletion, 2 LCLs with 
p!6 promoter hypermethylation, and 1 CLL with p7f5 INK4a gene 
mutation), and the 4 lymphomas with BMI-1 amplification. Negative 
or negligible levels of pl6 INKL4a were observed in the 6 tumors with 
INK4a/ARF locus alterations. These cases were not included in the 
comparisons between BMI-1 and pl6 INK4a mRNA expression. The 
pl6 INK4a expression levels were relatively similar in the different 
types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach statistical significance (Fig. IB). No 
differences were observed in the pl6 ,NK4a mRNA levels between 
tumors with BMI-1 gene amplification and overexpression and lym- 
phomas with germline configuration of the gene. 



CLL MCL MCL* LCL FL 




Fig. 1 . Southern blot analysis of BMI-1 gene. Four MCLs (MCL*) showed BMI- 1 gene 
amplification (3- to 7-fold) compared with non-neoplastic tissues (AO and other NHLs. No 
amplifications or gene rearrangements were detected in the remaining NHLs and carci- 
nomas included in the study. 
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Fig. 2. A, quantitative BMI-1 mRNA transcript analysis (median and range) using 
real-time RT-PCR in a series of NHLs. MCLs with BMJ-l gene amplification (MCL*) 
revealed significantly higher overall BMI-1 mRNA levels than all other types of NHLs, 
including MCLs with no structural alterations of the gene (P < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs (P < 0.001). Results are depicted 
as the ratio of absolute BMI-1 :GADPH mRNA transcript numbers (RU). Bars, SD. B, 
quantitative pl6 ,NK4a mRNA transcript analysis (median and range) using real-time 
RT-PCR in a series of NHLs. Expression levels were relatively similar in the different 
types of tumors. Results are depicted as the ratio of absolute pl6 !NK4a :GADPH mRNA 
transcript numbers (RU). Bars, SD. 



Discussion 

In the present study, we have examined a large series of human 
tumors for the presence of gene alterations and mRNA expression of 
the BMI-1 gene. Gene amplification was identified in four MCLs. 
These tumors showed significantly higher levels of mRNA and pro- 
tein expression compared with other lymphomas with BMI-1 in germ- 
line configuration. BMI-1 expression levels were also highly up- 
regulated in a subset of MCLs with no apparent structural alterations 
of the gene. No alterations were detected in any of the different types 
of carcinomas included in the study. BMI-1 is considered an oncogene 
belonging to the Polycomb group family of genes. These proteins 
mainly act as transcriptional regulators, controlling specific target 
genes involved in development, cell differentiation, proliferation, and 
senescence. Different studies have shown the implication of BMI-1 
overexpression in the development of lymphomas in murine and 
feline animal models (3, 4). The findings of the present study indicate 



for the first time that BMI-1 gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type. 

The human BMI-1 gene has been mapped to chromosome 10pl3. 
High-level DNA amplifications and gains in this region have been 
identified by comparative genomic hybridization in occasional solid 
tumors and NHLs (10, 11). Different chromosomal translocations 
involving the 10pl3 region have also been identified in infant leuke- 
mias and T cell lymphoproliferative disorders (7, 8, 15). Most acute 
leukemias with this chromosomal alteration occur in children <12 
months of age, whereas it seems to be extremely rare in adults. 1 Op 
translocations in T-cell lymphoproliferative disorders have been ob- 
served mainly in adult T cell leukemia/lymphomas and occasional 
cutaneous T cell lymphomas. In our study, we did not observe BMI-1 
rearrangements or amplifications in any of the acute leukemias or T 
cell lymphomas. However, all of the acute leukemias in this study 
were diagnosed in patients over 16 years, and no adult T cell leuke- 
mia/lymphomas or cutaneous lymphomas could be included in the 
series. Similarly, high-level DNA amplifications at the 10pl3 region 
have been detected in head and neck carcinomas and other solid 
tumors. Although we found no evidence for BMI-1 gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these tumors. Additional studies are required to elucidate the 
possible involvement of BMI-1 in these particular groups of human 
neoplasms. 

In human hematopoietic cells, BMI-1 is preferentially expressed in 
primitive CD34+ bone marrow cells, whereas it is negative or very 
low in more mature CD34- cells (16). In peripheral lymphocytes, and 
particularly in follicular B cells, BMI-1 protein expression has been 
detected in resting cells of the mantle zone, whereas it is down- 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BMI-1 expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow 
and antigen-specific response in peripheral lymphocytes. BMI-1 ex- 
pression in human tumors has not been examined previously. In this 
study, we have demonstrated that BMI-1 mRNA and protein expres- 
sion show a distinct pattern in different types of lymphomas. Thus, 
BMI-1 levels were low in LCLs and FLs and significantly higher in 
MCLs and CLLs. These findings suggest that BMI-1 expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their normal cell counterparts; because FLs and at least a subgroup 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are tumors mainly derived from 
naive pregerminal center cells. However, the four MCLs with BMI-1 
gene amplification expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations 
of the gene showed high mRNA levels similar to those observed in 
tumors with BMI-1 gene amplification, suggesting that other mecha- 
nisms may be involved in up-regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMI-1 gene expression on skeleton development 
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Fig. 3. Western blot analysis of BMI-1 protein in NHLs. The amplified MCL (17624) 
showed the highest BMI-1 protein levels, whereas other MCLs and CLLs had intermediate 
levels of expression. Very low or negative signal was observed in FLs and LCLs. 
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and lymphomagenesis (1,3). These observations suggest that the high 
mRNA and protein levels detected in a subset of MCLs may play a 
role in the pathogenesis of these neoplasms. 

Recent studies have identified the INK4/ARF locus as a down- 
stream target of the BMI-1 transcriptional repressor activity, suggest- 
ing that BMI-1 overexpression may contribute to human neoplasias 
that retain the wild-type INK4a/ARF locus (5). Interestingly, in our 
study, BMI-1 amplification and overexpression appeared in tumors 
with no alterations in p!6 WK4a and p!4 ARF genes. However, we could 
not detect differences in the expression levels of pl6 ,NK4a in tumors 
with and without BMI-1 gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than INK4a/ARF are the main targets of 
BMI-1 repressor activity in these tumors. Particularly, different genes 
of the HOX family are regulated by BMI-1 and may also be involved 
in lymphomagenesis (19, 20). 

In conclusion, the findings of this study indicate that BMI-1 gene 
expression is differentially regulated in B cell lymphomas. Alterations 
of the gene seem to be an uncommon phenomenon in human neo- 
plasms, but they may contribute to the pathogenesis in a subset of 
MCLs. Although, BMI-1 gene alterations occurred in tumors with 
wild-type INK4a/ARF locus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMI-1 in human neoplasms 
require additional analysis. 
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Id proteins antagonize basic helix-loop-helix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Id-1, Id-2, and Id-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species were expressed at high levels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific anti-Id 
antibodies. Immunohistochemistry using these anti- 
bodies demonstrated the presence of faint Id-1 and 
Id-2 immunostaining in pancreatic ductal cells in the 
normal pancreas, whereas Id-3 immunoreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant Id-1, 
Id-2, and Id-3 immunoreactivity. Scoring on the basis 
of percentage of positive cells and intensity of immu- 
nostaining indicated that Id-1 and Id-2 were increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id immuno- 
reactivity was also seen in the ductal cells in the 
CP-like areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
Id-1 and Id-2 immunoreactivity was as significantly 
elevated as in the cancer cells. These findings suggest 
that increased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (Am J Pathol 199% 155:815-822) 



Basic helix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation. 1 * 2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1,2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes. 3 ' 5 These tissue- 
specific bHLHs typically form heterodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A, E2-2, HEB, and other genes (also termed 
E-proteins) 6 " 9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that lack 
the basic DNA binding domain. 1 ' 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 10 " 12 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo. 13-18 In addition, Id proteins seem 
to be required for cell cycle progression through G^S 
phase in certain cell types, and interaction between Id-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro? 9 ' 23 

Pancreatic cancer is the fifth leading cause of cancer 
death in the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the overexpression of a variety of mito- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation of the p53 tumor 
suppressor gene. 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein Id-2, and 
that enhanced expression of this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory disease that is characterized by 
dysplastic ducts, foci of proliferating ductal cells, acinar 
cell degeneration, and fibrosis. 27 We now report that 
there is a five- to sixfold increase in ld-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Normal 



Cancer 



f# ft #*9lji*«Me| ld-2 



Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-68 years), CP tissues from 13 males and 1 female 
(median age 42.1 years; range 30-56 years), and pan- 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years; range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
duodenopancreatectomy (Whipple/pylorus-preserving 
Whipple; n = 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n = 1 ) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
(UICC) 6 tumors were stage 1 , 1 was stage 2, and 9 were 
stage 3 ductal cell adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. All studies were ap- 
proved by the Ethics Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 



Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
viously 26,28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1 .2% agarose/1 .8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979-bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A 
eamHI 190-bp fragment of mouse 7S cDNA that hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
autoradiographs were scanned to quantify the intensity of 
the radiographic bands. 26 28 For each sample the ratio of 
Id mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used for normal/ 
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Figure 1. mRNA expression of ld-1, ld-2, and ld-3 in pancreatic cancer and 
chronic pancreatitis. Total RNA (20 /ig/lane) from six normal, eight cancer- 
ous, and seven chronic pancreatitis tissue samples were subjected to North- 
ern blot analysis using 32 P-labeled cDNA probes (500,000 cpm/ml) specific 
for ld-1, ld-2, and ld-3, respectively. A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control. Exposure times of the normal/cancer 
blots were 1 day for all Id probes, and 2 days for the normal/CP blots. 
Exposure time was 4 hours for mouse 7S cDNA. By comparison with the 
normal samples, ld-1 and ld-3 mRNA levels were elevated in 8 and 9 cancer 
samples, respectively, whereas ld-2 was elevated in 6 cancer samples. 



cancer and normal/CP membranes. The median score for 
ld-1 , ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific, San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of significance. 

Cell Culture and Western Blot Analysis 

PANC-1, MIA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell lines were obtained from ATCC (Manas- 
sas, VA). COLO-357 human pancreatic cells were a gift 
from Dr. R. S. Metzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MIA-PaCa-2, PANC-1) or 
RPMI (ASPC-1, CAPAN-1) supplemented with 10% fetal 
bovine serum, 100 U/ml penicillin, and 100 jig/ml strep- 
tomycin. For immunoblot analysis, exponentially growing 



Id Proteins in Pancreatic Diseases 817 

AJP September 1999, Vol. 155, No. 3 



Hi 

a 

> 
S3 



o 
o 



1600 
1400 
1200 
1000 
800 
600 





a 


ld-1 


D 

n 








I 


o 


a a 







1200 



800 



400 



200 



ld-2 


m 

a 

a -T- 
D a 1 


°b Ac 


a 


0 


□ 

cP □ 

D D 



1 2Kb-* 




14 kDa- 



i mRNA 



ld-1 

Protein 



L6kb-+ 
1.2kb-» 



18 kDo^ 
IdkDa^ 



If IT* 



mRNA 



ld-2 



Protein 



500 - 


ld-3 


a 




400 - 


o 








□ 


1.3 kb+ 


300 - 




200 - 








100 - 






14kDa-» 


0 - 


8.8 







Normal CP Cancer 

Figure 2. Densitometry analysis of Northern blots. Autoradiographs of 
Northern blots from 12 normal, 14 CP, and 16 pancreatic cancers were 
analyzed by densitometry. mRNA levels were determined by calculating the 
ratio of the optical density for the respective Id mRNA species in relation to 
the optical density of mouse 7S cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in the cancer and CP samples, 
the same normal samples were used for normal/cancer and normal/CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues by 
triangles, and cancer tissues by squares. Data are expressed as median 
scores ± SD. By comparison with the normal samples, only the cancer 
samples exhibited significant increases: 6.5-fold (P< 0.01) for ld-1, fivefold 
(P < 0.01) for ld-2, and twofold (P - 0.027) for Id-3. 



cells (60-70% confluent) were solubilized in lysis buffer 
containing 50 mmol/L Tris-HCI, pH 7.4, 150 mmol/L NaCI, 
1 mmol/L EDTA, 1 ^g/ml pepstatin A, 1 mmol/L phenyl- 
methylsulfonyl fluoride (PMSF), and 1% Triton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 
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Figure 3. Id mRNA and protein expression in pancreatic cancer cell lines 
Upper panels: Total RNA (20 Mg/lane) from 5 pancreatic cancer cell lines 
were subjected to Northern blot analysis using 32 P-labeled cDNA probes 
(500,000 cpm/ml) specific for ld-1, ld-2, and ld-3, respectively. Exposure 
times were 1 day for all Id probes. Lower panels: Immunoblotting. Cell 
lysates (30 fig/lane) were subjected to SDS-PAGE. Membranes were probed 
with specific ld-1, ld-2, and ld-3 antibodies. Visualization was performed by 
enhanced chemiluminescence. 

amide gel electrophoresis (SDS-PAGE), transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemiluminescence. 

Immunohistochemistry 

Specific rabbit anti-human ld-1 (C-20), ld-2 (C-20), and 
ld-3 (C-20; all from Santa Cruz Biotechnology, Santa 
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Figure 4. Normal and cancerous pancreatic tissues were subjected to immu- 
nostaining using highly specific anti-Id- 1 (A-C), anti-Id-2 (D-F), and anti-Id-3 
(G-l) antibodies as described in the Methods section. Moderate to strong Id-1 
immunoreactivity was present in the cytoplasm of duct-like cancer cells (A 
and C, left panel). In the normal pancreas there was weak Id-1 immunore- 
activity in the ductal cells (B). Preabsorption with the Id- 1 -specific blocking 
peptide abolished the Id-1 immunoreactivity (C, right panel). Strong Id-2 
immunoreactivity was observed in the cytoplasm of the cancer cells that 
exhibited duct-like structures (D and F, left panel), whereas in the normal 
pancreas, there was only weak Id-2 immunoreactivity in the ductal cells (E). 
Preabsorption with the Id-2-specific blocking peptide abolished the Id-2 
immunoreactivity (F, right panel). Moderate to strong Id-3 immunoreactivity 
was present in the duct-like cancer cells (G and I, left panel). Moderate to 
strong Id-3 immunoreactivity was also present in the ductal cells of normal 
pancreatic tissue samples (H). Id-3 immunoreactivity was completely abol- 
ished by preabsorption with the Id-3 specific blocking peptide (I, right 
panel). A, D, and G constitute serial sections of a pancreatic cancer sample, 
revealing coexpression of the three Id proteins. Scale bars, 25 |im, 

Cruz, CA) polyclonal antibodies were used for immunhis- 
tochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with Id-1 , Id-2, and Id-3, respec- 
tively, of human origin, as determined by Western 
blotting. Paraffin-embedded sections (4 ^m) were sub- 
jected to immunostaining using the streptavidin-peroxi- 
dase technique. Where indicated, immunostaining for all 
three Id proteins was performed on serial sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes (23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex, 
using diaminobenzidine tetrahydrochloride as the sub- 
strate. Sections were counterstained with Mayer's hema- 
toxylin. Preabsorption with Id-1-, Id-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were semiquantitatively analyzed as de- 
scribed previously. 29,30 The percentage of positive 
cancer cells was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1, <33% of the cancer 
cells exhibiting immunoreactivity; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal was also stratified into 
four groups: 0, no immunoreactivity; 1, weak immunore- 
activity; 2, moderate immunoreactivity; 3, strong immu- 
noreactivity. Finally, the sum of the results of the cell 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1.2-kb Id-1 transcript and the 
1.6-kb Id2 mRNA transcript in 11 of the 12 normal pan- 
creatic samples, and the 1.3-kb Id-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, Id-1 mRNA levels were elevated in 8 of 16 
samples, Id-2 mRNA levels were elevated in 9 of these 
samples, and Id-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A, 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometric analysis of 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in Id-1 mRNA levels, a fivefold 
increase (P < 0.01) in Id-2 mRNA levels, and a twofold 
increase (P = 0.027) in Id-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of ld-1 , Id-2, and 
Id-3, in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer cell lines by North- 
ern and Western blot analyses. Id-1 mRNA was present 
at varying levels in all 5 cell lines (Figure 3). ASPC-1, 
CAPAN-1, MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels of Id-1 mRNA, whereas COLO-357 cells 
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Figure 5. Inimunohistochemistry of pancreatic cancer and dysplastic ducts in CP tissues. In the pancreatic cancer tissues (A-C) there was moderate to strong Id-1 
(A) Id-2 (B) and Id-3 (C) immunoreactivity in the ductal cells in the areas adjacent to the cancer cells that exhibited CP-like alterations. Islet cells d>d not exhibit 
Id immunoreactivity (outlined by solid arrowheads). In the CP samples, moderate to strong Id-1 (D), Id-2 (E), and Id-3 (F) immunoreactivity was present in the 
cytoplasm of epithelial cells forming large dysplastic ducts. Scale bar, 25 fim. 



expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-ld-1 antibody 
confirmed the presence of the approximately 14-kd Id-1 
protein in the 4 cell lines that expressed high levels of 
Id-1 mRNA (Figure 3). Furthermore, the three cell lines 
with the highest Id-1 mRNA expression (CAPAN-1, MIA- 
PaCa-2, and PANC-1) also exhibited the highest Id-1 
protein expression. Variable levels of the 1.6-kb Id-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 and MIA-PaCa-2 cells. Immunoblot analysis 
with a highly specific anti-ld-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exception of PANC-1 
cells, in which the 16-kd band was relatively faint (Figure 
3). With the exception of MIA-PaCa-2 cells, there was a 
good correlation between Id-2 mRNA and protein levels 
(Figure 3). Id-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells. Id-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3). 
Immunoblot analysis with a highly specific anti-ld-3 anti- 
body revealed an approximately 14-kd band that was most 
abundant in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a faint 
Id-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with the exception of PANC-1 cells, there was a good cor- 
relation between Id-3 mRNA and protein levels. 



To determine the localization of Id-1, Id-2, and Id-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong Id-1 immunoreactivity was present in 
the cancer cells in 9 of 10 randomly selected cancer 
samples. An example of moderate Id-1 immunoreactivity 
is shown in Figure 4A, and of strong immunoreactivity in 
Figure 4C (left panel). In contrast, in the normal pancreas, 
faint Id-1 immunoreactivity was present only in the ductal 
cells of pancreatic ducts (Figure 4B, arrowheads). Pre- 
absorption with the Id-1 -specific blocking peptide com- 
pletely abolished the Id-1 immunoreactivity (Figure 4C, 
right panel). The cancer cells also exhibited strong Id-2 
(Figure 4, D and F, left panel) and moderate to strong Id-3 
immunoreactivity. An example of moderate Id-3 immuno- 
reactivity is shown in Figure 4G, and of strong immuno- 
reactivity in Figure 41 (left panel). In contrast, only faint 
Id-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas Id-3 immuno- 
reactivity in these cells was more variable and ranged 
from moderate to occasionally strong (Figure 4H). Islet 
cells and acinar cells were always devoid of Id immunore- 
activity. Preabsorption of the respective antibody with the 
blocking peptides specific for Id-2 (Figure 4F, right panel) 
and Id-3 (Figure 41, right panel) completely abolished im- 
munoreactivity. Analysis of serial pancreatic cancer sec- 
tions revealed that there was often colocalization of the 
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Figure 6. Immunohistochemistry of atypical papillary epithelium in CP tissues. Serial section analysis of some CP samples revealed the presence of large duct-like 
structures with atypical papillary epithelium. Mild to moderate Id-1 (A) and Id-2 (B) immunoreactivity and weak Id-3 (C) immunoreactivity was present in the 
cytoplasm of the cells forming these large ducts with papillary structures. Some CP samples also exhibited moderate Id-3 immunoreactivity in these cells (D). Scale 
bar, 25 /im. 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and G. 

Id-1, ld-2, and Id-3 immunoreactivity was also present 
at moderate levels in the cytoplasm of ductal cells within 
CP-like areas adjacent to the cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. In 4 of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular ducts, all of which exhibited mod- 
erate to strong Id-1, ld-2, and Id-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis of one of those CP samples revealed 
mild to moderate Id-1 and ld-2 immunoreactivity and 
weak Id-3 immunoreactivity in the cells of these atypical 
papillary ducts (Figure 6, A-C). In contrast, in some of 
these CP samples, moderate to strong Id-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
the ductal cells forming the typical ductular structures of 
CP, such as large interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id immunoreactivity (data not shown). 



The immunohistochemical data for Id-1, ld-2, and Id-3 
are summarized in Table 1. In the case of Id-1 and ld-2, 
the cancer cells as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast, due to the marked variability in Id-3 immuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family of HLH transcription factors 
that are important regulators of cellular differentiation and 
proliferation. 1,2 To date, four members of the human Id 
family have been identified. 1,10 " 12 Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic stimuli, 19,31 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 15,32 " 34 The forced expression 
of Id-1 in mouse small intestinal epithelium results in 
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Table 1. Histological Scoring 









ld-1 


ld-2 


ld-3 


Normal (n = 
Cancer (n = 
CP (n = 9) 


6) 
10) 


Ductal cells 
Cancer cells 

Typical CP lesions (n = 9) 
Dysplastic ducts (n = 4) 
Atypical papillary ducts (n = 5) 


2.0 ± 0.4 
4.5* ± 0.5 

2.7 ± 0.5 
5.3+ ± 0.2 
4.4* ± 0.2 


2.3 ± 0.2 
5.2 § ± 0.3 

3.1 ±0.6 
5.8* ± 0.2 
5.2* ± 0.2 


2.5 ± 0.9 
4.5 ± 0.6 
3.4 ± 0.7 
5.3 ± 0.4 
5.0 ± 0.4 



Scoring of the histological specimens was performed as described in the Patients and Methods section. Values are the means ± SD of the number 
of samples indicated in parenthesis. P values are based on comparisons with the respective controls in the normal samples. 
\P< 0.02; f P < 0.01; *P = 0.004; § P = 0.001. 



adenoma formation in these animals 35 The growth-pro- 
moting effects of Id genes are thought to occur through 
several mechanisms. For example, ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20,21 and ld-1 and ld-2 
can antagonize the bHLH-mediated activation of known 
inhibitors of cell cycle progression such as the cyclin- 
dependent kinase inhibitor p21. 23 

In the present study, we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2, and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and ld-2 immunoreactivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1, ld-2, 
and ld-3 immunoreactivity. In the cancer samples, all 
three Id proteins often colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the half-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view of the fact that two possible initiation 
codons have been reported for the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosis 
pathways. Thus ; these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 38 and 
abnormal resistance to Fas-ligand-mediated apopto- 
sis 39 It has been shown recently that forced constitutive 
expression of Id genes together with the expression of 
anti-apoptotic genes such as Bcl-2 or BclX L can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells in vivo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, there was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples, Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population. 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- 
creatic cancer, such as the adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted that K-ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process 37,41-43 Increased Id ex- 
pression may contribute to malignant transformation of 
cultured cell lines in vitro" and has been linked to cell 
invasion in a murine mammary epithelial cell line 44 In 
view of the current findings that ld-1, ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP, these observations raise the 
possibility that elevated levels of ld-1, ld-2, and, to a 
lesser extent, ld-3 may represent relatively early markers 
of pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancreatic cancer. 
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In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous range of protein abundance 
and, for identified spots, a good correlation between protein abundance, mRNA abundance, and codon bias. 
For each molecule of well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproteins were identified. The behavior of proteins in 
differential centrifugation experiments was examined. Such experiments with 2D gels can give a global view of 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. We have 
begun to analyze the yeast proteome by using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according to iso- 
electric point in one dimension and molecular weight in the 
other dimension (21), allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging and computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
cerevisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
and the relative lack of introns in S. cerevisiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are three pillars on which such a database rests: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for S. cerevisiae have been described else- 
where (7, 8). Here we describe quantitative data for these 
proteins under a variety of experimental conditions. 

MATERIALS AND METHODS 
Strains and media. S. cerevisiae W303 (MATa ade2-l his3-ll,15 leu2~3, 112 
trpl-1 ura3-l canl-100) was used (26). - Met YNB (yeast nitrogen base) medium 
was 1.7 g of YNB (Difco) per liter, 5 g of ammonium sulfate per liter, and 
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adenine, uracil, and all amino acids except methionine; -Met -Cys YNB me- 
dium was the same but without methionine or cysteine. Medium was supple- 
mented with 2% glucose (for most experiments) or with 2% ethanol (for ethanol 
experiments). Low-phosphate YEPD was described by Warner (28). 

Isotopic labeling of yeast and preparation of cell extracts. Yeast strains were 
labeled and proteins were extracted as described by Garrels et al. (7, 8). Briefly, 
cells were grown to 5 X 10 6 cells per ml. at 30°C; 1 ml of culture was transferred 
to a fresh tube, and 0.3 mCi of [ 35 Slmethionine (e.g., Express protein labeling 
mix; New England Nuclear) was added to this 1-ml culture. The cells were 
incubated for a further 10 to 15 min and then transferred to a 1.5-ml microcen- 
trifuge tube, chilled on ice, and harvested by centrifugation. The supernatant was 
removed, and the cell pellet was resuspended in 100 p.1 of lysis buffer (20 mM 
Tris-HCl [pH 7.6], 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.1% deoxycholate; just before use, phenylmethylsulfonyl fluoride was added to 
1 mM, leupeptin was added to 1 u.&'ml, pepstatin was added to 1 \x.gfm\, tosyl- 
sulfonyl phenylalanyl chloromethyl ketone was added to 10 u-g/ml, and soybean 
trypsin inhibitor was added to 10 u.g/ml). 

The resuspended cells were transferred to a screw-cap 1.5-ml polypropylene 
tube containing 0.28 g of glass beads (0.5-mm diameter; Biospec Products) or 
0.40 g of zirconia beads (0.5-mm diameter; Biospec Products). After the cap was 
secured, the tube was inserted into a MiniBeadbeater 8 (Biospec Products) and 
shaken at medium high speed at 4°C for 1 min. Breakage was typically 75%. 
Tubes were then spun in a microcentrifuge for 10 s at 5,000 X g at 4°C. 

With a very fine pipette tip, liquid was withdrawn from the beads and trans- 
ferred to a prechilled 1.5-ml tube containing 7 u.1 of DNase I (0.5 mg/ml; Cooper 
product no. 6330)-RNase A (0.25 mg/ml; Cooper product no. 5679 )-Mg (50 mM 
MgCl 2 ) mix. Typically 70 uJ of liquid was recovered. The mixture was incubated 
on ice for 10 min to allow the RNase and DNase to work. 

Next, 75 jxl of 2x dSDS (2x dSDS is 0.6% sodium dodecyl sulfate [SDS], 2% 
mercaptoethanol, and 0.1 M Tris-HCl [pH 81) was added. The tube was plunged 
into boiling water, incubated for 1 min, and then plunged into ice. After cooling, 
the tube was centrifuged at 4°C for 3 min at 14,000 x g. The supernatant was 
transferred to a fresh tube and frozen at -70°C About 5 uJ of this supernatant 
was used for each 2D gel. 

2D polyacrylamide gels. 2D gels were made and run as described elsewhere 
(6-8). 

Image analysis of the gels. The Quest II software system was used for quan- 
titative image analysis (20, 22). Two techniques were used to collect quantitative 
data for analysis by Quest II software. First, before the advent of phosphorim- 
agers, gels were dried and fluorographed. Each gel was exposed to film for three 
different times (typically 1 day, 2 weeks, and 6 weeks) to increase the dynamic 
range of the data. The films were scanned along with calibration strips to relate 
film optical density to disintegrations per minute in the gels and analyzed by the 
software to obtain a linear relationship between disintegrations per minute in the 
spots and optical densities of the film images. The quantitative data are ex- 
pressed as parts per million of the total cellular protein. This value is calculated 
from the disintegrations per minute of the sample loaded onto the gel and by 
comparing the film density of each data spot with density of the film over the 
calibration strips of known radioactivity exposed to the same film. This yields the 
disintegrations per minute per millimeter for each spot on the gel and thence its 
parts-per-minute value. 

After the advent of phosphorimaging, gels bearing 35 S-labeled proteins were 
exposed to phosphorimager screens and scanned by a Fuji phosphorimager, 
typically for two exposures per gel. Calibration strips of known radioactivity were 
exposed simultaneously. Scan data from the phosphorimager was assimilated by 
Quest II software, and quantitative data were recorded for the spots on the gels. 
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Measurements of protein turnover. Cells in exponential phase were pulse- 
labeled with [ 35 S]methionine, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to 160 min (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (10 min). Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the different 
samples had different amounts of protein but equal amounts of 35 S). Spots were 
quantitated with a phosphorimaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
line through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. In one of the experiments, there was one such 
spot. To the eye, this was a minor, unidentified spot seen only in the first two 
samples (10 and 20 min). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, we concluded that none of the identified 
spots (and all but one of the visible spots) represented proteins with long 
half-lives. 

Centrifugal fractionation. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above except that no detergent (i.e., no 
deoxycholate) was present in the lysis buffer. The crude lysate was cleared of 
unbroken cells and large debris by centrifugation at 300 x # for 30 s. The 
supernatant of this centrifugation was then spun at 16,000 x g for 10 min to give 
the pellet used for Fig. 6B. The supernatant of the 16,000 x g, 10-min spin was 
then spun at 100,000 X g for 30 min to give the supernatant used for Fig. 6A. 

Protein abundance calculations. A haploid yeast cell contains about 4X10 12 
g of protein (1, 15). Assuming a mean protein mass of 50 kDa, there are about 
50 x 10* molecules of protein per cell. There are about 1.8 methionines per 10 
kDa of protein mass, which implies 4.5 x 10 8 molecules of methionine per cell 
(neglecting the small pool of free Met). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the total number of counts on each gel (by 
integrating counts over the entire gel), and (iii) the total number of counts 
loaded on the gel (by scintillation counting of the original sample). Thus, we 
know what fraction of the total incorporated radioactivity is present in each spot. 
After correcting for the methionine (and cysteine [see below]) content of each 
protein, we calculated an absolute number of protein molecules based on the 
fraction of radioactivity in each spot and on 50 x 10 6 total molecules per cell. 

The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, the number of cysteine molecules 
per protein was also taken into account in calculating the number of molecules 
of protein, but Cys molecules were weighted one-fifth as heavily as Met mole- 
cules. 

mRNA abundance calculations. For estimation of mRNA abundance, we used 
SAGE (serial analysis of gene expression) data (27) and Affymetrix chip hybrid- 
ization data (29a, 30). The mRNA column in Table 1 shows mRNA abundance 
calculated from SAGE data alone. However, the SAGE data came from cells 
growing in YEPD medium, whereas our protein measurements were from cells 
growing in YNB medium. In addition, SAGE data for low-abundance mRNAs 
suffers from statistical variation. Therefore, we also used chip hybridization data 
(29a, 30) for mRNA from cells grown in YNB. These hybridization data also had 
disadvantages. First, the amounts of high-abundance mRNAs were systemati- 
cally underestimated, probably because of saturation in the hybridizations, which 
used 10 u,g of cRNA. For example, the abundance of ADH1 mRNA was 197 
copies per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of EN02 mRNA was 248 copies per cell by SAGE but only 41 by 
hybridization. When the amount of cRNA used in the hybridization was reduced 
to 1 u-g, the apparent amounts of mRNA were similar to the amounts determined 
by SAGE (29a, 29b). However, experiments using 1 u.g of cRNA have been done 
for only some genes (29a). Because amounts of mRNA were normalized to 
15,000 per cell, and because the amounts of abundant mRNAs were underesti- 
mated, there is a 2.2-fold overestimate of the abundance of nonabundant 
mRNAs. We calculated this factor of 2.2 by adding together the number of 
mRNA molecules from a large number of genes expressed at a low level for both 
SAGE data and hybridization data. The sum for the same genes from hybrid- 
ization data is 2.2-fold greater than that from SAGE data. 

To take into account these difficulties, we compiled a list of "adjusted" mRNA 
abundance as follows. For all high-abundance mRNAs of our identified proteins, 
we used SAGE data. For all of these particular mRNAs, chip hybridization 
suggested that mRNA abundance was the same in YEPD and YNB media. For 
medium-abundance mRNAs, SAGE data were used, but when hybridization 
data showed a significant difference between YEPD and YNB, then the SAGE 
data were adjusted by the appropriate factor. Finally, for low-abundance 
mRNAs, we used data from chip hybridizations from YNB medium but divided 
by 2.2 to normalize to the SAGE results. These calculations were completed 
without reference to protein abundance. 

CAI. The codon adaptation index (CAI) was taken from the yeast proteome 
database (YPD) (13), for which calculations were made according to Sharp and 
Li (24). Briefly, the index uses a reference set of highly expressed genes to assign 
a value to each codon, and then a score for a gene is calculated from the 
frequency of use of the various codons in that gene (24). 

Statistical analysis* The JMP program was used with the aid of T. Tully. The 
JMP program showed that neither mRNA nor protein abundances were nor- 
mally distributed; therefore, Spearman rank correlation coefficients (r s ) were 



calculated. The mRNA (adjusted and unadjusted) and protein data were also 
transformed so that Pearson product-moment correlation coefficients (r p ) could 
be calculated. First, this was done by a Box-Cox transformation of log-trans- 
formed data. This transformation produced normal distributions, and an r p of 
0.76 was achieved. However, because the Box-Cox transformation is complex, we 
also did a simpler logarithmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, we calculated the 
r p and found that it was 0.76, identical to the coefficient from the Box-Cox 
transformed data. We therefore believe that this correlation coefficient is not 
misleading, despite the fact that the log(mRNA) distribution is not quite normal. 



RESULTS 

Visualization of 1,400 spots on three gel systems. Yeast 
proteins have isoelectric points ranging from 3.1 to 12.8, and 
masses ranging from less than 10 kDa to 470 kDa. It is difficult 
to examine all proteins on a single kind of gel, because a gel 
with the needed range in pi and mass would give poor resolu- 
tion of the thousands of spots in the central region of the gel. 
Therefore, we have used three gel systems: (i) pH "4 to 8" with 
10% polyacrylamide; (ii) pH "3 to 10" with 10% polyacryl- 
amide; and (iii) nonequilibrium with 15% polyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% polyacrylamide gel. The pH 
at the basic end of the isoelectric focusing gel cannot be main- 
tained throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend the pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acrylamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these conditions (27, 30). Here, we focus 
on the proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins are present on these 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrate. When two proteins are 
resolved, but are close together, and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when the weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus, weak spots 
can be seen only when they are well separated from strong 
spots. 

For a given gel, the number of detectable spots initially rises 
with exposure time. However, beyond an optimal exposure, the 
number of distinguishable spots begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
At long exposures, the whole autoradiogram turns black. Thus, 
there is an optimum exposure yielding the maximum number 
of spots, and at this exposure the weakest spots are not seen. 

Largely because of the problem of coverage, the proteins 
seen are strongly biased toward abundant proteins. All identi- 
fied proteins have a CAI of 0.18 or more, and we have iden- 
tified no transcription factors or protein kinases, which are 
nonabundant proteins. Thus, this technology is useful for ex- 
amining protein synthesis, amino acid metabolism, and glyco- 
lysis but not for examining transcription, DNA replication, or 
the cell cycle. 
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FIG. 1. 2D gels. The horizontal axis is the isoelectric focusing dimension, which stretches from pH 6.7 (left) to pH 4.3 (right). The vertical axis is the polyacrylamide 
gel dimension, which stretches from about 15 kDa (bottom) to at least 130 kDa (top). For panel A, extract was made from cells in log phase in glucose; for panel B, 
cells were grown in ethanol. The spots labeled 1 through 6 are unidentified proteins highly induced in ethanol. 
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Spot identification. The identification of various spots has 
been described elsewhere (7, 8). At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

Pulse-chase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-lives (Materi- 
als and Methods). Cells were labeled with [ 35 S]methionine for 
10 min, and then an excess of unlabeled methionine was added. 
Samples were taken at 0, 10, 20, 30, 60, and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were run, and spots were quantitated. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translational modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-lives of more than 90 min. Gygi 
et al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-lives. This result does not imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same half-life, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine, and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified protein (Tables 1 and 2; Materials and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming an average content of Met and Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that we are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had 8,904 ppm, the 10th 
most abundant had 2,842 ppm, the 100th most abundant had 
314 ppm, the 500th most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than optimum 
exposure) had 23 ppm. Thus, there is more than a 300-fold 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gel, the most abundant 60 proteins account 
for 50%, and the most abundant 500 proteins account for 80%. 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins, we estimate that half of the total 
cellular protein is accounted for by fewer than 100 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlation of protein abundance with mRNA abundance. 
Estimates of mRNA abundance for each gene have been made 
by SAGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table 1 lists the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second incorpo- 
rates data from both SAGE and hybridization (30) (adjusted 
mRNA) (Table 1; Materials and Methods). We correlated 
protein abundance with mRNA abundance (Fig. 2). For ad- 



justed mRNA versus protein, the Spearman rank correlation 
coefficient, r s , was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r p , on log transformed data (Materials and 
Methods) was 0.76 (P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is at least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to posttranscriptional 
regulation, and some is due to errors in the mRNA or protein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the low protein/mRNA 
ratio of Yef3. It is the most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA half-life of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of cell protein is synthesized in about 120 
min under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every 2 s. This is a remarkably high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; this in turn implies 
translation of over 20 amino acids per s, a rate considerably 
higher than estimated for mammalians (3 to 8 amino acids per 
s) (18). These estimates depend on the amount of mRNA per 
cell (11, 27). 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
controlled for less abundant proteins. Many nonabundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptimal rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translational initiation. For example, there are over 600 yeast 
genes which probably have short open reading frames in the 
mRNA upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein abundance. The 
mRNAs for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias with abun- 
dance continues for medium-abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we have used the CAI (24) (Materials and Methods) because 
it gives a result between 0 and 1. The r s for CAI versus protein 
abundance is 0.80 (P < 0.0001), similar to the mRNA-protein 
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- CAI, a measure of codon bias, is taken from the YPD. mRNA, number of mRNA molecules per cell from SAGE data (27); adjusted mRNA, number of mRNA 
molecules per cell based on both SAGE and chip hybridization (30) (see Materials and Methods); Protein (Glu) number of molecules of protein per cell in 
YNB-elucose- Protein (Eth) number of molecules of protein per cell in YNB-ethanol; E/G ratio, ratio of protein abundance in ethanol to glucose. The E/G ra tio is 

not repeatable P ( NR) in multiple gels. Some gene products (e.g., Tifl and Tif2 [Tifl 2]) were difficult to distinguish on either 
a protein or an mRNA basis; these are pooled. No Nla, there was no suitable Nlalll site in the 3' region of the gene, and so there are no SAGE mRNA data; No Met, 
the mature gene product contains no methionines, and so there are no reliable protein data. 
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TABLE 2. Functions of proteins listed in Table 1 



Name" 



YPD title lines 6 



Alcohol dehydrogenase I; cytoplasmic isozyme reducing acetaldehyde to ethanol, regenerating NAD 
Alcohol dehydrogenase II; oxidizes ethanol to acetaldehyde, glucose repressed 

Citrate synthase, peroxisomal (nonmitochondrial); converts acetyl-CoA and oxaloacetate to citrate plus CoA 
Enolase 1 (2-phosphoglycerate dehydratase); converts 2-phospho-D-glycerate to phosphoenol pyruvate in g yco ysis 
Enolase 2 (2-phosphoglycerate dehydratase); converts 2-phospho-D-glycerate to phosphoenolpyruvate in glycolysis 
Fructose bisphosphate aldolase II; sixth step in glycolysis 

Hexokinase I; converts hexoses to hexose phosphates in glycolysis; repressed by glucose 

Hexokinase II; converts hexoses to hexose phosphates in glycolysis and plays a regulatory role in glucose repression 
Isocitrate lyase, peroxisomal; carries out part of the glyoxylate cycle; required for gluconeogenesis 
Pyruvate dehydrogenase complex, El beta subunit 

Pyruvate decarboxylase isozyme 1 .... , , i i • 

Phosphofructokinase alpha subunit, part of a complex with Pfk2p which carries out a key regulatory step in glycolysis 
Glucose-6-phosphate isomerase, converts glucose-6-phosphate to fructose-6-phosphate 
Pyruvate carboxylase 1; converts pyruvate to oxaloacetate for gluconeogenesis 

Transaldolase; component of nonoxidative part of pentose phosphate pathway . , 

Glyceraldehyde-3-phosphate dehydrogenase 2; converts D-glyceraldehyde 3-phosphate to 1,3-dephosphog ycerate 
Glyceraldehyde-3-phosphate dehydrogenase 3; converts D-glyceraldehyde 3-phosphate to 1 ,3-dephosphoglycerate 
Triphosphate isomerase; interconverts glyceraldehyde-3-phosphate and dihydroxyacetone phosphate 

Translation elongation factor EF-l(i; GDP/GTP exchange factor for Teflp/Tef2p 

Translation elongation factor EF-2; contains diphthamide which is not essential for activity; identica to btt2p 
Translation elongation factor EF-2; contains diphthamide which is not essential for activity; identical to Eftlp 
Translation initiation factor eIF3 beta subunit (p90); has an RNA recognition domain 
Acidic ribosomal protein AO 

Translation initiation factor 4A (eIF4A) of the DEAD box family 
Translation initiation factor 4A (eIF4A) of the DEAD box family 
Translation elongation factor EF-3A; member of ATP-binding cassette superfamily 

Chaperonin homologous to E. coli HtpG and mammalian HSP90 

Mitochondrial chaperonin that cooperates with HsplOp; homolog of E coli GroEL 

Heat-inducible chaperonin homologous to E. coli HtpG and mammalian HSP90 m 

Heat shock protein required for induced thermotolerance and for resolubilizing aggregates of denatured proteins; important for Ipsi J- 

HM^d^S^nS^mdt^ltmk reticulum lumen required for protein translocation across the endoplasmic reticulum membrane 

and for nuclear fusion; member of the HSP70 family 
Cytoplasmic chaperone; heat shock protein of the HSP70 family 
Cytoplasmic chaperone; member of the HSP70 family 
Heat shock protein of HSP70 family involved in the translation^ apparatus 

Heat shock protein of HSP70 family, cytoplasmic . .. . . . 

Mitochondrial protein that acts as an import motor with Tim44p and plays a chaperonin role in receiving and folding of protein chains 

during import; heat shock protein of HSP70 family 
Heat shock protein of the HSP70 family; multicopy suppressor of mutants with hyperactivated Ras/cychc AMP pathway 
Stress-induced protein required for optimal growth at high and low temperature; has tetratncopeptide repeats 

Phosphoribosylamidoimidazole-succinocarboxamide synthase: catalyzes the seventh step in de novo purine biosynthesis pathway 
C, tetrahydrofolate synthase (trifunctional enzyme), cytoplasmic 

Phosphoribosvlamine-glycine ligase plus phosphoribosylformylglycinamidine cyclo-ligase; Afunctional protein 
Argininosuccinate lyase; catalyzes the final step in arginine biosynthesis 

Glutamate dehydrogenase (NADP + ); combines ammonia and a-ketoglutarate to form glutamate 

Glutamine synthetase; combines ammonia to glutamate in ATP-driven reaction . . . . . int . cfPrtC - 

Phosphpribos^l-AMP cyclohydrolase/phosphoribosyl-ATP pyrophosphohydrolase/histidinol dehydrogenase; 2nd, 3rd, and 10th steps of 

Ketol-a^ (acetohydroxy, acid reductoisomerase) (alpha-keto-{5-hydroxylacyl) reductoisomerase); second step in Val 

Sa^aropine 8 Shydrogena«TNADP + i L-glutamate forming) (saccharopine reductase), seventh step in lysine biosynthesis pathway 
Homocysteine methyltransferase; (5-methyltetrahydropteroyl triglutamate-homocysteine methyltransferase), methionine synthase, 
cobalamin independent 

-/-Glutamyl phosphate reductase (phosphoglutamate dehydrogenase), proline biosynthetic enzyme 
Phosphoserine transaminase; involved in synthesis of serine from 3-phosphoglycerate 
Tryptophan synthase, last (5th) step in tryptophan biosynthesis pathway 

Actin; involved in cell polarization, endocytosis, and other cytoskeletal functions 
Adenylate kinase (GTP:AMP phosphotransferase), cytoplasmic 
Cytosolic acetaldehyde dehydrogenase 

Beta subunit of Fl-ATP synthase; 3 copies are found in each Fl oligomer 
Homolog of mammalian 14-3-3 protein; has strong similarity to Bmh2p 
Homolog of mammalian 14-3-3 protein; has strong similarity to Bmhlp 

Protein of the AAA family of ATPases; required for cell division and homotypic membrane fusion 
Leucyl-tRNA synthetase, cytoplasmic . 
Farnesyl pyrophosphate synthetase; may be rate-limiting step in sterol biosynthesis pathway 
DL-Glycerol phosphate phosphatase 

Ran, a GTP-binding protein of the Ras superfamily involved in trafficking through nuclear pores 
Inorganic pyrophosphatase, cytoplasmic . . . . 

Component of serine C-palmitoyltransferase; first step in biosynthesis of long-chain base component of sphingolipids 
Thiamine-repressed protein essential for growth in the absence of thiamine 

Poly(A)-bincJin£ protein of cytoplasm ancfnucleus; part of the 3'-end RNA-processing complex (cleavage factor I); has 4 RNA 

recognition domains 
Mannose-1 -phosphate guanyltransferase; GDP-mannose pyrophosphorylase 
Ribonucleotide reductase small subunit 
S-Adenosylmethionine synthetase 1 
5-Adenosylmethionine synthetase 2 
Copper-zinc superoxide dismutase 
Ubiquitin-activating (El) enzyme 
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FIG. 2. Correlation of protein abundance with adjusted mRNA abundance. 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cell of the cognate mRNA, with an r p of 0.76. Note the 
logarithmic axes. Data for mRNA were taken from references 27 and 30 and 
combined as described in Materials and Methods: 



correlation, confirming a strong correlation between CAI and 
protein abundance (Fig. 3). The relationship between CAI and 
protein abundance is log linear from about 1,000,000 to about 
10,000 molecules per cell. We have no data for rarer proteins. 

It is not clear whether CAI reflects maximum or average 
levels of protein expression. The proteins used for the CAI- 
protein correlation included some proteins which were not 
expressed at maximum levels under the condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg4, His4, and others). 
When these proteins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitutive) proteins was recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
molecules/cell) = (2.3 X CAI) + 3.7. Thus, under certain 
conditions (a CAI of 0.3 or greater; a constitutively expressed 
gene), a very rough estimate of protein abundance can be 
made by raising 10 to the power of [(2.3 X CAI) + 3.7]. 

The distribution of CAI over the genome (Fig. 4) consists of 
a lower, bell-shaped distribution, possibly indicating a region 
where there is no selection for codon bias, and an upper, flat 
distribution, starting at a CAI of about 0.3, possibly indicating 
a region where there is selection for codon bias. Almost all of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. In the lower, bell-shaped 
region, we do not know whether there is a correlation between 
CAI and protein abundance. 

Changes in protein abundance in glucose and ethanol. A 
comparison of cells grown in glucose (Fig. 1A) with cells grown 
in ethanol (Fig. IB) is shown in Table 1. As is well known, 
some proteins are induced tremendously during growth on 
ethanol. Two striking examples are the peroxisomal enzymes 
Icll (isocitrate lyase) and Cit2 (citrate synthase), which are 
induced in ethanol by more than 100- and 12-fold, respectively 
(Fig. 1; Table 1). These enzymes are key components of the 
glyoxylate shunt, which diverts some acetyl coenzyme A 
(acetyl-CoA) from the tricarboxylic acid cycle to gluconeogen- 
esis. S. cerevisiae requires large amounts of carbohydrate for its 
cell wall; in ethanol medium, this carbohydrate comes from 
gluconeogenesis, which depends on the glyoxylate shunt and 
on the glycolytic pathway running in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. 

During gluconeogenesis, substrate-product relationships are 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse 
reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 mM), while Adh2 has a lower K m (0.8 mM) (5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Enol (enolase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by 
2-phosphoglycerate (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpyruvate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K, n for phosphoenol- 
pyruvate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased 
heat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (Eftl, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose. 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproteins. About 300 distinct spots, probably represent- 
ing 150 to 200 proteins, could be seen on pH 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a 
different kind of labeled protein ( 32 P only [Fig. 5B], 32 P plus 
35 S [Fig. 5A], and 35 S only [not shown, but see Fig. 1 for 
example]). In this way, we made provisional identification of 
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some of the 32 P-labeled spots as particular 35 S-labeled spots. 
All such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigrate. Nevertheless, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P-labeled proteins are the hexokinases Hxkl and 
Hxk2, the acidic ribosome-associated protein RpaO, the trans- 
lation factors Yef3 and Efbl, and probably Hsp70 heat shock 
proteins of the Ssa and Ssb families. RpaO and Efbl are quan- 
titatively monophosphorylated. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has five or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no 32 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. Tifl has at least seven spots (7); two of these 
overlap with some 32 P label, but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 32 P-labeled spots 
(a, c, and d in Fig. 5). Spots that seem to be extra forms of 
Met6, Pdcl, Eno2, and Fbal can be seen in Fig. 6A, but there 
is little 32 P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different protein isoforms seen. 

The cell cycle is regulated in part by phosphorylation. We 
compared 32 P-labeled proteins from cells synchronized in Gj 
with a-factor, in cells synchronized in G x by depletion of G £ 
cyclins, and in cells synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
to reproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily. 

Centrifugal fractionation. We fractionated 35 S-labeled ex- 
tracts by centrifugation (Materials and Methods). Figure 6A 
shows the proteins in the supernatant of a high-speed 
(100,000 X g, 30 min) centrifugation, while Fig. 6B shows the 
proteins in the pellet of a low-speed (16,000 X g, 10 min) 
centrifugation. Many proteins are tremendously enriched in 
one fraction or the other, while others are present in both. 



Most glycolytic enzymes (e.g., Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pfkl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Eftl, Yef3, Prtl, Tifl, and RpaO) are in the pellet, 
possibly because of the association of ribosomes with the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantial portion of the Eftl. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 [not shown] and Ilv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gspl is in 
the pellet fraction. The enrichment produced by centrifugation 
makes it possible to see minor spots which are otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Tifl spot is surrounded by as many as six 
other spots that cofractionate. We observed six identical or 
very similar additional spots when we overexpressed Tifl from 
a high-copy-number plasmid (not shown). Signal overlaps only 
one or two of these spots in 32 P-labeling experiments (Fig. 5), 
and so the different forms are not mainly due to different 
phosphorylation states. 

DISCUSSION 

Our experience with developing a 2D gel protein database 
for S. cerevisiae is summarized here. With current technology, 
we can see the most abundant 1,200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the 1,200 proteins seen, we have identified 148, with a 
bias toward the most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene overexpression will be particu- 
larly useful, since it is not affected by the lower abundance of 
the remaining visible proteins. 




FIG. 5. Phosphorylated proteins. (A) Mixture of 32 P-labeIed proteins and 35 S-Iabeled proteins. Two separate labeling reactions were done, one with 32 P and one 
with 35 S, and extracts were mixed and run on a 2D gel. Spots marked with numbers rather than gene names represent spots noted on 35 S gels but unidentified. Spots 
labeling with 32 P were identified' by (i) increased labeling compared to the 35 S-only gel (not shown); (ii) the characteristic fuzziness of a 32 P-labeled spot; and (iii) the 
decay of signal intensity seen on exposures made 4 weeks later (not shown). A minor form of Tpil and at least six minor forms of Tifl have been noted in overexpression 
experiments (see also Fig. 6B); positions of the minor forms are indicated by circles. (B) 32 P-only labeling. The major form of Tpil, which is not labeled with 32 P, is 
indicated by a large circle; positions of seven forms of Tifl are indicated by smaller circles. 



7365 



A 



115 kDa 




C0C48 



HSC82 



TDH2 



> f !j^, sl ppA( BMH2 



YMRIlSc 



TPtf TPll' 



, ^YKL056C 



16 kDa 



B 



115 kDa ! 



6.7 




4.3 



UBAI 



CDC48 



HSC82 



, I! I ■■ > ■ S SB 

* « * PGI1 PDCI- /PU8I 

5520'"" *85&0,(VMA2?) 

. + 1 ; ACT! 



/T0H3 

4 



T0H2 



PSA1 



MOLI 

tp>pp:r 



mm 



RPLAO 



|,GSPI 



16 kDa 



6.7 



pH 



4.3 



FIG. 6. Fractionation by centrifugation. (A) Proteins in the supernatant of a 100,000 X g, 30-min spin; proteins in the pellet of a 16,000 x g, 10-min spin. Supernatant 
fractions examined in multiple experiments done over a wide range of g forces looked similar to each other, as did the pellet fractions. 



7366 



Vol. 19, 1999 



SAMPLING OF THE YEAST PROTEOME 7367 



2D gels of the kind that we have used are not suitable for 
visualization of rare proteins. However it will be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu- 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et al. (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
mRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi et al. feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (10). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus on the fact that the correlations of 
mRNA and codon bias with protein abundance are far from 
perfect, while we focus on the fact that, considering the wide 
range of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and to real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of mRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r p values with some subsets of the 
data, Gygi et al. concluded that the correlation of mRNA to 
protein was poor. However, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, it would be valid only if both mRNA and 
protein abundances were normally distributed. In fact, both 
distributions are very far from normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion that codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (r s ). Since this 
statistic is nonparametric, there is no requirement for the data 
to be normally distributed. Using the r s , we find that mRNA 
abundance is well correlated with protein abundance (r s = 
0.74), and the CAI is also well correlated with protein abun- 
dance (r s = 0.80) (and also with mRNA abundance [data not 
shown]). For the data of Gygi et al. (10), we obtained similar 
results, though with their data the correlation is not as good; r s 
= 0.59 for the mRNA-to-protein correlation, and r s = 0.59 for 
the codon bias-to-protein correlation. 

In a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box-Cox and logarithmic, were used; both gave 
good correlations with our data [e.g., r = 0.76 for log( adjusted 
RNA) to log(protein)]. We were not able to transform the data 
of Gygi et al. to a normal distribution. 

Finally, there are also some differences in data between the 
two studies. These may be partly due to the different measure- 
ment techniques used: Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
rimaging of intact gels coupled to image analysis. We com- 
pared our data to theirs for the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minimal media, and excluding Tifl, 
which was anomalous in differing by 100-fold between the two 
data sets). The r s between the two protein data sets was 0.88 
(P < 0.0001). Although this is a strong correlation, the fact that 



it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data of Gygi et al. to be slightly higher than ours for 
the highest-abundance proteins and also for the lowest-abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. Although we do not 
know what the errors are, we suggest the following as a rea- 
sonable speculation. For the highest-abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the phosphorimager screens. For 
the lowest-abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately cutting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with mRNA abundance. We calculate 
that the r s is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r s is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi et al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorimaging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fact that we used both SAGE data and RNA 
hybridization data, which is most helpful for the least abundant 
mRNAs. In summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi et al. 
may reflect difficulty in accurately measuring these nonabun- 
dant proteins and mRNAs, rather than indicating a truly poor 
correlation in vivo. It is not surprising that observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? With r p = 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r p ) 2 , is 0.58. 
This means that more than half (in log space) of the variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), and (r p ) 2 = 
0.58 for the log data means that of this 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about 10-fold is unexplained (but could be due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of the proteome. First, there is an enormous range of 
protein abundance, from nearly 2,000,000 molecules per cell 
for some glycolytic enzymes to about 100 per cell for some cell 
cycle proteins (26a). Second, about half of all cellular protein 
is found in fewer than 100 different gene products, which are 
mostly involved in carbohydrate metabolism or protein synthe- 
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sis. Third, the correlation between protein abundance and CAI 
is log linear as far as we can see, which is from about 10,000 
protein molecules per cell to about 1,000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels. It also 
means that codon bias is a useful predictor of protein abun- 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between protein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This last conclusion 
raises questions as to how the levels of nonabundant proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
tant for these proteins. 
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